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Preface 


This volume contains the Proceedings of the 3rd Stochastic Transport in Upper 
Ocean Dynamics Annual Workshop held on 26-29 September 2022. The workshop 
is part of the project “Stochastic Transport in Upper Ocean Dynamics” (STUOD) 
supported by an European Research Council Synergy Grant. The four principal 
investigators of the projects, Prof. Dan Crisan (ICL), Prof. Bertrand Chapron 
(IFREMER) Prof. Darryl Holm (ICL) and Prof. Etienne Memin (INRIA), were 
delighted to organize the annual workshop: the event was largest to date with an 
impressive range of inspiring talks, engaging theoretical and applied sessions, as 
well as networking opportunities. 

The STUOD project is supported by an ERC Synergy Grant, led by three world- 
class institutions: Imperial College London (ICL), National Institute for Research 
in Digital Science and Technology (INRIA) and the French Research Institute for 
Exploitation of the Sea (IFREMER). The project aims to deliver new capabilities 
for assessing variability and uncertainty in upper ocean dynamics and provide 
decision makers a means of quantifying the effects of local patterns of sea level 
rise, heat uptake, carbon storage and change of oxygen content and pH in the 
ocean. The project will make use of multimodal data and will enhance the scientific 
understanding of marine debris transport, tracking of oil spills and accumulation of 
plastic in the sea. 

As in the previous years, the 3rd STUOD Annual Workshop 2022 focused on a 
range of fundamental topical areas, including: 


1. Observations at high resolution of upper ocean properties such as temperature, 
salinity, topography, wind, waves and velocity 

2. Large-scale numerical simulations 

3. Data-based stochastic equations for upper ocean dynamics that quantify simula- 
tion errors 

4. Stochastic data assimilation to reduce uncertainty 


Each chapter in the present volume illustrates one or several of these topical 
areas. Many chapters offer new mathematical frameworks that are intended to 
enhance future research in the STUOD project. The workshop was held in the 
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hybrid mode and brought together participants from countries such as: UK, France, 
USA, Italy, Germany, Saudi Arabia, the Netherlands and Switzerland. It was well 
attended by early-career academics, post-graduate students, senior members of the 
community and other invited guests. 

The scientific program of the four-day conference was divided into five sessions 
covering Data Assimilation, Physics Models, Data, Numerics for Ocean Models, 
and Theoretical Analysis of SPDEs. Several members of the STUOD External 
Advisory Board, Prof Sebastian Reich (University of Potsdam), Prof Baylor Fox- 
Kemper (Brown University), Prof Rosemary Morrow (Laboratoire d’Etudes en 
Géophysique et Océanographie Spatiales), Prof Laurent Debreu (INRIA) and 
Prof Arnaud Debussche Ecole Normale Supérieure de Rennes, gave invited talks. 
The programme also included individual presentations by the STUOD principal 
investigators and post-doctoral researchers and snapshot presentations by invited 
speakers such as Dr Ali Mashayekhi (Imperial College London) and Dr Hamza 
Ruzaygat (King Abdullah University of Science and Technology). The workshop 
also provided opportunities for investigators at an early stage of their career to 
have discussions with established scientists, fostering potential future research 
collaborations, networking as well as inclusion and training of the next generation 
of researchers. 


Some of the in-person participants attending the event 
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The following is a brief description of the 15 contributions included in the 
proceedings: 

The submitted manuscripts include the chapter by Adrien Bella, Noé Lahaye, 
and Gilles Tissot entitled “Internal Tides Energy Transfers and Interactions with the 
Mesoscale Circulation in Two Contrasted Areas of the North Atlantic”. The chapter 
investigates the energy budget of the internal tide and its life cycle with a high- 
resolution numerical simulation and a vertical normal mode decomposition. Two 
areas of interest are considered: the Azores Islands over the mid-Atlantic ridge and 
the Gulf Stream offshore the North of the US East coast shelf break. Low mode (1 
and 2) internal tides are found to propagate from 100 km (mode 2) to 1000 km (mode 
1) away from their generation sites. Waves lose a significant portion of their energy 
as they propagate through the Gulf Stream, in contrast to the Azores domain. In the 
Gulf Stream domain, the mesoscale circulation is responsible for energy transfers 
from low to high mode internal tides, while the topographic scattering is dominant 
in the Azores area. This transfer of energy toward high modes favours energy 
dissipation. The mesoscale is significant in the energy budget of modes higher than 
mode 1 for both domains, and for all baroclinic modes in the Gulf Stream area. The 
internal tide is found to extract or lose energy toward the mesoscale circulation, but 
this accounts for less than 14% of the energy scattered from low internal tide modes 
to higher ones once summed over all contributions of the modal energy budget. 

The contribution of Paolo Cifani, Sagy Ephrati and Milo Viviani entitled “Sparse- 
Stochastic Model Reduction for 2D Euler Equations” introduces a reduction 
technique for ideal fluids. 

The 2D Euler equations are a simple but rich set of non-linear PDEs that 
describe the evolution of an ideal inviscid fluid, for which one dimension is 
negligible. Solving numerically these equations can be extremely demanding. 
Several techniques to obtain fast and accurate simulations have been developed 
during the last decades. In this chapter, the authors present a novel approach which 
combines recent developments in the stochastic model reduction and conservative 
semi-discretization of the Euler equations. In particular, starting from the Zeitlin 
model on the 2-sphere, they derive reduced dynamics for large scales and we close 
the equations either deterministically or with a suitable stochastic term. Numerical 
experiments show that, after an initial turbulent regime, the influence of small scales 
to large scales is negligible, even though a non-zero transfer of energy among 
different modes is present. 

Franco Flandoli, Silvia Morlacchi, and Andrea Papini present in their work, 
“Effect of Transport Noise on Kelvin—Helmholtz Instability”, a numerical inves- 
tigation of the dissipation properties of very small-scale transport noise. As a test 
problem, the authors consider the Kelvin-Helmholtz instability and compare the 
inviscid case, the viscous one, both without noise, and the inviscid case perturbed 
by transport noise. They observe a partial similarity with the viscous case, namely a 
delay of instability. 

The chapter by Daniel Goodair and Dan Crisan, entitled “On the 3D Navier- 
Stokes Equations with Stochastic Lie Transport’, investigates the existence and 
uniqueness of maximal solutions to the 3D SALT Navier-Stokes Equation in 
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velocity and vorticity form, on the torus and the bounded domain, respectively. 
The work partners an earlier paper of the authors as an application of the abstract 
framework presented there. They demonstrate the efficacy of the earlier work in 
showing well-posedness for both the velocity and vorticity form of the equation, as 
well as obtaining the first analytically strong existence result for a fluid equation 
perturbed by Lie transport noise on a bounded domain. 

Darryl Holm, Ruiao Hu, and Oliver Street in their paper entitled “On the 
Interactions Between Mean Flows and Inertial Gravity Waves" derive a generalized 
Lagrangian mean (GLM) theory as a phase-averaged Eulerian Hamilton variational 
principle expressed as a composition of two smooth invertible maps. Following 
an earlier work of Holm, they consider 3D inertial gravity waves (IGWs) in the 
Euler-Boussinesq fluid approximation. The authors provide both deterministic and 
stochastic closure models for GLM IGWs at leading order in 3D complex vector 
WKB wave asymptotics. The chapter brings the earlier results of Holm at leading 
order into an easily assimilable short form and proposes a stochastic generalization 
of the wave mean flow interaction (WMFI) equations for IGWs. 

The contribution of Quentin Jamet, Etienne Mémin, Franck Dumas, Long Li, 
and Pierre Garreau entitled "Toward a Stochastic Parameterization for Oceanic 
Deep Convection" investigates parametrizations for ocean convections. Current 
climate models are known to systematically overestimate the rate of deep-water 
formation at high latitudes in response to too deep and too frequent deep convection 
events. The authors propose in this study to investigate a misrepresentation of deep 
convection in Hydrostatic Primitive Equation (HPE) ocean and climate models 
due to the lack of constraints on vertical dynamics. They discuss the potential 
of the Location Uncertainty (LU) stochastic representation of geophysical flow 
dynamics to help in the process of re-introducing some degree of non-hydrostatic 
physics in HPE models through a pressure correction method. The authors then 
test these ideas with idealized Large Eddy Simulations (LES) of buoyancy-driven 
free convection with the CROCO modelling platform. Preliminary results at LES 
resolution exhibit a solution obtained with our Quasi-nonhydrostatic (Q-NH) model 
that tends toward the reference non-hydrostatic (NH) model. As compared to a pure 
hydrostatic setting, our Q-NH solution exhibits vertical convective plumes with a 
larger horizontal structure, a better spatial organization, and a reduced intensity 
of their associated vertical velocities. The simulated Mixed Layer Depth (MLD) 
deepening rate is however too slow in our Q-NH experiment as compared to 
the reference NH, a behaviour that opposes to that of hydrostatic experiments of 
producing too fast MLD deepening rate. These preliminary results are encouraging, 
and support future efforts in the direction of enriching coarse resolution, hydrostatic 
ocean, and climate models with a stochastic representation of non-hydrostatic 
physics. 

The contribution of Alexander Lobbe, Dan Crisan, Darryl Holm, Etienne 
Mémin, Oana Lang, and Bertrand Chapron, entitled “Comparison of Stochastic 
Parametrization Schemes Using Data Assimilation on Triad Models", investigates 
stochastic parametrization schemes. In recent years, stochastic parametrizations 
have been ubiquitous in modelling uncertainty in fluid dynamics models. One source 
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of model uncertainty comes from the coarse-graining of the fine-scale data and is 
in common usage in computational simulations at coarser scales. In this chapter, 
the authors look at two such stochastic parametrizations: the Stochastic Advection 
by Lie Transport (SALT) parametrization and the Location Uncertainty (LU) 
parametrization. Whilst both parametrizations are available for full-scale models, 
the authors study their reduced order versions obtained by projecting them on a 
complex vector Fourier mode triad of eigenfunctions of the curl. Remarkably, these 
two parametrizations lead to the same reduced order model, termed the helicity- 
preserving stochastic triad (HST). This reduced order model is then compared with 
an alternative model which preserves the energy of the system, and which is termed 
the energy preserving stochastic triad (EST). These low-dimensional models are 
ideal benchmark models for testing new Data Assimilation algorithms: they are easy 
to implement, exhibit diverse behaviours depending on the choice of the coefficients, 
and come with natural physical properties such as the conservation of energy and 
helicity. 

Erwin Luesink and Bernard Geurts consider in the chapter “An Explicit Method 
to Determine Casimirs in 2D Geophysical Flows” a new method to construct 
Casimirs for geophysical flows. 

Conserved quantities in geophysical flows play an important role in the charac- 
terization of geophysical dynamics and aid the development of structure-preserving 
numerical methods. A significant family of conserved quantities is formed by 
the Casimirs: These are integral conservation laws that are in the kernel of the 
underlying Poisson bracket. The Casimirs hence determine the geometric structure 
of the geophysical fluid equations among which the enstrophy is well known. Often 
Casimirs are proposed on heuristic grounds and later verified to be part of the kernel 
of the Poisson bracket. In this work, the authors explicitly construct Casimirs by 
rewriting the Poisson bracket in vorticity-divergence coordinates thereby providing 
explicit construction of Casimirs for 2D geophysical flow dynamics. 

The work of Igor Maingonnat, Gilles Tissot, and Noé Lahaye is entitled 
“Correlated Structures in a Balanced Motion Interacting with an Internal Wave” 
and investigates the correlations between a balanced motion and the incoherent part 
of a wave in an idealized configuration. 

Characterizing the loss of coherence of an internal tide propagating through 
mesoscale turbulence has been a major challenge in oceanography, particularly due 
to its implications for the interpretation of satellite data. The authors introduce a new 
modal decomposition technique, named broad-band proper orthogonal decomposi- 
tion (BBPOD), which consists in performing a proper orthogonal decomposition 
(POD) on complex demodulated variables. After connecting BBPOD to the standard 
SPOD, they show that BBPOD, coupled with the extended POD technique, enables 
them to associate the principal components of the incoherent field to the slow flow 
structures responsible for this loss of coherence through triadic interactions with the 
incident wave. 

The chapter by Etienne Mémin, Long Li, Noé Lahaye, Gilles Tissot, and Bertrand 
Chapron, entitled “Linear Wave Solutions of a Stochastic Shallow Water Model” 
investigates the wave solutions of a stochastic rotating shallow water model. The 
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approximate model covered in the chapter provides an interesting simple description 
of the interplay between waves and random forcing ensuing either from the wind or 
coming as the feedback of the ocean on the atmosphere and leading in a very fast 
way to the selection of some wavelength. This interwoven, yet simple, mechanism 
explains the emergence of typical wavelength associated with near-inertial waves. 
Ensemble-mean waves that are not in phase with the random forcing are damped 
at an exponential rate, whose magnitude depends on the random forcing variance. 
Geostrophic adjustment is also interpreted as a statistical homogenization process 
in which, in order to conserve potential vorticity, the small-scale component tends 
to align to the velocity fields to form a statistically homogeneous random field. 

The contribution of Said Ouala, Bertrand Chapron, Fabrice Collard, Lucile 
Gaultier, and Ronan Fablet entitled “Analysis of Sea Surface Temperature variability 
Using Machine Learning” analyses sea surface temperature (SST). SST is a 
critical factor in the global climate system and plays a key role in many marine 
processes. Understanding the variability of SST is therefore important for a range of 
applications, including weather and climate prediction, ocean circulation modelling, 
and marine resource management. In this study, the authors use machine learning 
techniques to analyse SST anomaly (SSTA) data from the Mediterranean Sea over 
a period of 33 years. The objective is to best explain the temporal variability of the 
SSTA extremes. These extremes are revealed to be well explained through a non- 
linear interaction between multi-scale processes. The results contribute to better 
unveil factors influencing SSTA extremes, and the development of more accurate 
prediction models. 

The contribution of Sebastian Reich entitled “Data Assimilation: A Dynamic 
Homotopy-Based Coupling Approach” covers a new approach for data assimilation 
based on homotopy. Homotopy approaches to Bayesian inference have found 
widespread use especially if the Kullback—Leibler divergence between the prior 
and the posterior distribution is large. The author extends one of these homotopy 
approach to include an underlying stochastic diffusion process. The underlying 
mathematical problem is closely related to the Schródinger bridge problem for given 
marginal distributions. He shows that the proposed homotopy approach provides 
a computationally tractable approximation to the underlying bridge problem. In 
particular, the implementation builds upon the widely used ensemble Kalman filter 
methodology and extends it to Schródinger bridge problems within the context of 
sequential data assimilation. 

Valentin Resseguier, Yicun Zhen, and Bertrand Chapron show in their work 
entitled “Constrained Random Diffeomorphisms for Data Assimilation" that both 
the Stochastic Advection by Lie Transport (SALT) and the Location Uncertainty 
(LU) equations can be recovered using a prescribed definition of a random diffeo- 
morphism used to perturb the physical space. However, unlike the SALT and LU 
settings, they propose a perturbation scheme does not directly rely on a particular 
physics. Hence, the random mapping is more flexible and can be applied to any 
PDE. 

The work of Gilles Tissot, Étienne Mémin, and Quentin Jamet entitled “Stochas- 
tic Compressible Navier-Stokes Equations Under Location Uncertainty" provides 
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a stochastic version under location uncertainty of the compressible Navier-Stokes 
equations. To that end, some clarifications of the stochastic Reynolds transport 
theorem are given when stochastic source terms are present in the right-hand 
side. The authors apply this conservation theorem to density, momentum, and total 
energy in order to obtain a transport equation of the primitive variables, i.e. density, 
velocity, and temperature. They show that performing low Mach and Boussinesq 
approximations to this more general set of equations allows to recover the known 
incompressible stochastic Navier-Stokes equations and the stochastic Boussinesq 
equations, respectively. Finally, they provide some research directions for using this 
general set of equations in the perspective of relaxing the Boussinesq and hydrostatic 
assumptions for ocean modelling. 

Francesco Tucciarone, Etienne Mémin, and Long Li present in their work entitled 
“Data-Driven Stochastic Primitive Equations with Dynamic Modes Decomposition” 
an implementation of a stochastic version of the primitive equations into the 
NEMO community ocean model to assess the capability of the so-called Location 
Uncertainty framework in representing the small scales of the ocean flows. The 
work is important as planetary flows are characterized by interaction of phenomena 
in a huge range of scales, and it is unaffordable today to resolve numerically the 
complete ocean dynamics. 

Finally, the STUOD Organizing Committee would like to acknowledge the 
financial and in-kind support received from several sources: the European Research 
Council (ERC) under the European Union’s Horizon 2020 Research and Innovation 
Programme (ERC, Grant Agreement No 856408)—for providing funds to cover the 
travel expenses of the invited speakers, catering costs, and administrative support; 
Imperial College London—for offering the conference venue. 
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Internal Tides Energy Transfers A 
and Interactions with the Mesoscale Checker| 
Circulation in Two Contrasted Areas 

of the North Atlantic 


Adrien Bella, Noé Lahaye, and Gilles Tissot 


Abstract The energy budget of the internal tide and its life cycle is investigated 
with a high resolution numerical simulation and a vertical normal mode decomposi- 
tion. Two areas of interest are considered: the Azores Islands over the mid Atlantic 
ridge and the Gulf Stream offshore the North of the US East coast shelf break. 
Low mode (1 and 2) internal tides are found to propagate from 100 km (mode 2) 
to 1000 km (mode 1) away from their generation sites. Waves loose a significant 
portion of their energy as they propagate through the Gulf Stream, in contrast to the 
Azores domain. In the Gulf Stream domain, the mesoscale circulation is responsible 
for energy transfers from low to high modes internal tides, while the topographic 
scattering is dominant in the Azores area. This transfer of energy toward high modes 
favours energy dissipation. The mesoscale is significant in the energy budget of 
modes higher than mode 1 for both domains, and for all baroclinic modes in the 
Gulf Stream area. The internal tide is found to extract or loose energy toward the 
mesoscale circulation, but this accounts for less than 14%, of the energy scattered 
from low internal tide modes to higher ones once summed over all contributions of 
the modal energy budget. 


Keywords Internal tide - Mesoscale flow - Energy budget - Vertical modal 
decomposition - High resolution numerical simulation 


1 Introduction 


Internal tides are a category of oceanic internal inertia-gravity waves that are 
generated when the astronomical tide interacts with topographic features such as 
shelf breaks, ridges or seamounts. They are encountered in many areas in the ocean 
[Zaron et al. 2022, Zhao et al. 2016, among others] and are of crucial importance for 
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the mixing of deep water and the closure of the Meridional Overturning circulation 
[Munk and Wunsch 1998]. The role of mesoscale currents for the energy pathway 
starting from the barotropic tide input and leading to this mixing is not yet fully 
understood. As a different thread of motivation, internal tides exhibit the same 
spatial scale as (sub-)mesoscale features, which renders challenging the estimation 
of the geostrophic velocities and the disentanglement of the two types of motions 
from satellite altimetry [Ponte and Klein 2015]. This problem is exacerbated by the 
ability of the new SWOT mission to obtain a resolution of 20 km to 30 km, compared 
to the approximately 100 km of current altimeter products [Ballarotta et al. 2019]. It 
is therefore needed to better understand the dynamics of the internal tide if we hope 
to disentangle it from the geostrophic field via data assimilation for instance. 

Since a decade or so [e.g. Arbic et al. 2010], global and regional realistic 
simulations have reached a sufficient resolution to be able to represent both internal 
tides and mesoscale features, opening the way to studying their interactions. 
Most studies focusing on these interactions use the vertical modal decomposition 
framework [Gill 1982], which allows to get a clear separation of the barotropic 
and baroclinic tides as well as alleviating the computational cost associated with 
processing 3D simulation outputs. 

The importance of the mesoscale circulation and buoyancy field have been shown 
by Kelly and Lermusiaux [2016] in a realistic configuration over the Gulf Stream 
area for the energy budget of the first baroclinic mode (denoted mode 1). They 
quantified the energy transfers between the mode 1 and the mesoscale circulation 
as well as its relative importance compared to topographic effects, showing that 
the mesoscale explains first order interactions pattern visible on the mode 1 energy 
flux divergence. Similarly, Pan et al. [2021] have analysed the energy budget of the 
mode 1 in a realistic setup including the mesoscale circulation and heterogeneous 
buoyancy field in the Gulf Stream area and the Mallau Island, leading to the same 
conclusion concerning the importance of mesoscale—internal tides interactions. 
Both studies have also considered realistic simulations with baroclinic currents, 
and have used a modal decomposition approach to define their energy budget. The 
deviation of internal tide energy flux have been analysed by Duda et al. [2018], 
indicating a refracting behaviour of a Gulf Stream like current on beams of internal 
tides energy fluxes. Dunphy and Lamb [2014] have shown in an idealised setup that 
the interactions of the first baroclinic internal tide mode with the barotropic eddy and 
the first baroclinic eddy mode lead to a modification of the energy flux propagation 
and a scattering of energy toward higher modes, respectively. 

In the present paper, our focus will be on the energy budget yielded by a modal 
analysis and we will in particular extend the work of other studies to the 10 first 
baroclinic modes, as well as studying the couplings between modes. 

The contribution of the mesoscale flow and buoyancy field will be taken into 
account and the relative importance of the coupling between modes as well as 
their spatial pattern will be quantified in an area where topographic features are 
prominent, and another area with a strong mesoscale circulation. 

The document is organised as follows: Sect.2 will explain the theoretical 
framework with the modal decomposition and the governing dynamical equations. 
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Section 3 will detail the dataset used and computational techniques. Section 4 will 
present results. Section 5 closes this article with a conclusion. 


2 Governing Equations and Energy Budget 


To investigate the modal internal tide energy budget, we rely on linearized equations 
of motions projected on vertical normal modes. The derivation follows previous 
studies (e.g. Kelly and Lermusiaux [2016]) and is only briefly depicted here. It starts 
from the hydrostatic primitive equations under the Boussinesq approximation with 
a free surface [e.g. Vallis 2017]: 


dun T u- Vun + fe; ^ un = —Vnp — V Tide, (1a) 
ip=b =O, (1b) 

à,b-u- Vb — 0, (1c) 

V-u=0, (1d) 


where u is the velocity, b the buoyancy, p the reduced pressure (divided by po, the 
reference density), and the index h denotes horizontal component of a 3D vector. 
Other variable names follow standard notations. Here, the forcing and dissipation 
terms have been omitted on purpose, except for the tidal potential term Tie. 

Next, the flow is decomposed into a low-frequency mesoscale flow and a high- 
frequency component that includes the internal tide: i.e., for the velocity, we split 
u = U + ü. We will further decompose the low frequency part of the buoyancy 
into a long-time mean and a slowly variable part: B = B + B’ ( denotes long 
time average), and introduce the Brunt-Väisälä frequency: N? = 0,B and N? = 
8, B . Subtracting the low frequency equations (1) to the initial system and further 
assuming that nonlinearities amongst the high-frequency flow are negligible, one 
obtains the following linear system of equations: 


9,4 JU. Vu - u- VU + fe, ^üà = —Vp — Vide, (2a) 

8.p — b — 0, (2b) 

9b +U - Vb + ün- V4(B +B + 0N? 4- N?) — 0, (2c) 
V.ü-0 (2d) 


Note that, since the above system of equations is linear in the high-frequency 
variables, and assuming that most of the mesoscale variability has a timescale longer 
or equal to the cutoff period used in the internal tides complex demodulation (3 
days), band-pass filtering of these equations leaves the system unchanged. Hence, 
in our study, we will focus on the semi diurnal (including M2 and S2 frequencies) 
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internal tide with a period centred on 12.2h, relying on the above set of equations 
where the high-frequency variables describe this frequency band. These equations 
are completed by the usual linearized boundary conditions at the mean surface 7 
and the bottom at z = —H: 


pGp = gj. 3) 
wm) = 9,0) + Ur) - Va), (4) 
w = —u;,(—H)- VH. (5) 


We thus have the internal tide part of the primitives equations that are now ready 
to be projected on a set of verticals modes. The latter are given locally by the 
standard Sturm-Liouville problem for internal waves in a rotating stratified ocean 
[e.g. Gill 1982, Vallis 2017], assuming a flat ocean, a horizontally-homogeneous 
stratification profile and no background flow: 


®, Py, : t d v2 — 
hz +z cd with ®, = Oatz = —H, and g@, + N^, = 0atz — 1, 
(6) 


n 
where prime denotes vertical derivative. The obtained set of vertical modes ®ņ is 
complemented by another set of vertical functions which obey: 


Pn = Dn, ®,,' = 773 9n (7) 


and these modes follow the orthogonality conditions: 


7 7 N2 g _ _ 
f Py Pp dz = / —3 mn dz + -3> Pm Mn N) = Hôm. (8) 
—H -H Cy C5 
Physical fields can then be expanded over these bases through: 


lun, p] — Y us. Pal Pn, (9) 


n 


[w, b] = Y Iw, N7balgn, (10) 


n 


and the modal projection coefficients are obtained as follows: 
[Un, Pn] = ([un, p]. (11) 


wn = -z (fon wÑ?) + won), (12) 
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1 = 
br= 5 c b) + -= eq) , (13) 
€ n 


with the inner product defined as such: (f, g) = + Jla f (z)g(z) dz. Note that the 
two bases have a parametric dependency on the horizontal coordinates because of 
the varying topography H (x, y) and stratification profile N?(z; x, y). In this paper, 
we are going to consider the modes 0 to 10, with the mode 0 being the barotropic 
tide, and all others being the baroclinic internal tide. 

The dynamical equations are then obtained by projecting the tidal momentum 
equations and continuity on a mode @,,, and buoyancy equations on the mode Øm, 
using the relations exposed above plus integration by part and Leibniz formula, 
and substituting the buoyancy modal amplitude by the pressure anomaly (after 
projection of the hydrostatic balance on Øm) [Kelly and Lermusiaux 2016, Kelly 
2016, Pan et al. 2021]. Omitting the * notation from now on, one obtains—after a 
tedious but straightforward derivation: 


O;Um + VhaPm F fé; ^ lg = — >. Umn i Vu, — X ustios 
n n 
V 
= p» Un - U mn n X Wn Uin 
n n 


1 " 
— D2 PaTnn — zV Mide (On G]) — 9m C- H)), 
n 


(14a) 


ðt Pm — Co Wm = -E 9m TU T -Vn- bt Ub -Vpn 
n 


N2 
— 3 Pn (on: Un: V (e) 
n Cn 


zu > ün : (Ban — By) F x Un lon. Pa N?) , 
n n 


(14b) 
Vj (Hum) + Hwy = H uta. (14c) 
n 
with: 
Tmn = (On, Van) , Umn = (Bm, Un®n) E ue = (Dn, Un i: VinPn) , 


N? 
Uy, = (m, 6, VUn), URA = (Pn, $5 0; U 5) , Ub. = (on: Ura on) , 
n 


Bmn = (Pm, P, V B) , By = (em. $, V, B) : 
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We then take the scalar product of the projected momentum equation (14a) 
with u,, multiply the pressure equation (14b) with pm and substitute wn using the 
continuity equation (14c) to obtain the energy budget of the internal tide mode m in 
presence of a slowly varying mesoscale circulation and buoyancy field: 


temporal variation of energy 
—. 


8 (ur 2 1 
ar ( + 73 + gyr’ (Aum Pm) = 


divergence of energy flux 


advection by the background flow 
eua ai aaaaaaaaaaaaaaaaasaaasasaasasasssasssssssMsa 


= y» | um - (Umn > Vun) — Um UnU, + > Ubi V ps 


n m 


Background flow shear 


A 


V 
— Un: Unn ‘Um + Um: Ux nWn 


horizontal gradient of background buoyancy slowly variable stratification 


Ooo 
Pm P 
+ 7a" ` (Bmn + Bmn) + ~ Un lon. N*g,) 
Hc, Cm 
Topography and mean stratification: Cmn 
fe FF 
+ PmUn* Tam — PnUm’ Tmn | + Rn (15) 


with 


advection of ssh by the background flow 


ji Pm 
Hd 


Rm = 


£95 G)Un M) - V 


advection of mean stratification by the background flow 


N? 
T ] a Pn (on. Un: Vn ( 2 on) 
c2 e 
n n 


m 


Tidal potential work 


T —————— 
— Um: V Hiige m (7) — Pn (—H)) E (16) 


By anticipation of the results in Fig.3 in Sect.4.2, the term Rm gathers all 
contributions that are not of primary importance (and will not be discussed in this 
paper). 

This equation describes the different energy exchanges (coupling) between 
modes as well as energy sources and sinks. Most coupling terms can be represented 
with coupling matrices -noted Kj, here—that will be shown below in Sect. 4.2 
(Fig. 2). Generally, this coupling matrix contains an anti-symmetric part, describing 
net exchanges of energy between modes, and a symmetric part, with the diagonal 
included, that describes gain/loss of energy for the internal tide. 
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The first three terms in the right-hand side (RHS) arise from the presence of 
a mesoscale flow, while the next two ones include contributions from associated 
buoyancy perturbations as well as seasonal variations. The term labelled as slowly 
variable stratification arises during the derivation of the CSW buoyancy equa- 
tion (14) because of our choice to construct the modal bases by using a stationary 
profile of stratification in the problem (6). In case of a definition of the vertical 
modal bases with the instantaneous stratification profile, this term would vanish. 
Indeed, for a linear propagation of an internal wave without topography, mesoscale 
and dissipation, the vertical modes diagonalise the operator, thus decouples the 
components from each other. By time-averaging the stratification profile, the 
associated vertical mode basis becomes inadequate to perform this decoupling. The 
slowly variable stratification term then arises, taking into account in the energy 
equation coupling between modes associated with this linear propagation. This term 
can be interpreted as energy exchanges between modes during a linear propagation 
of an internal wave without topography, mesoscale and dissipation. Since vertical 
modes are eigenfunctions of this associated idealised operator, we associate the 
variable stratification term to the inadequacy of the basis to represent internal 
wave dynamics (through the time-averaged stratification profile). We do not expect, 
however, that using a basis defined with a slowly time-varying stratification would 
change significantly the magnitude of the dominant interaction terms discussed 
in this paper (see e.g. Sect. 4.2), and in particular the mesoscale and topographic 
contributions. 

Finally, Cmn = —Cnhm is the topographic scattering, which includes the 
generation of baroclinic tide from the barotropic tide (Cmo). Note that this term 
has no symmetric part—it can only redistribute energy amongst the vertical modes. 

Once averaged in time over a sufficiently long period (typically, one month), the 
time-variation of the energy vanishes and the above modal energy equation reduces 
to the following, simplified form: 


Vak in == 5 Kmn F Rm, (17) 


n 


where F„ denotes the modal energy flux, Kin, gathers the energy transfers, sources 
and sinks that are not negligible and Rm gathers all the neglected terms. These 
include a contribution from the free surface and the tidal potential, which would both 
vanish under a rigid-lid assumption for the baroclinic modes—in agreement with the 
fact that we find they have negligible magnitude in our diagnostics (cf. Sect. 4). The 
remaining term in Rm involves horizontal variations of the modes and associated 
background stratification, which is small since a year average is used. 
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3 Data and Method 


3.1 eNATL60 Simulation 


The high-resolution realistic simulation of the North Atlantic Ocean eNATL60 
[Brodeau et al. 2020] is used to diagnose the energy budget of the internal tide 
following Eq. (15). It uses the Nucleus for European Modelling of the Ocean 
(NEMO) model, which solves the primitive equations under the Boussinesq and 
hydrostatic approximations with an Arakawa C grid and z-coordinate with partial 
step [Madec and the NEMO team 2008]. The eNATL60 run includes astronomical 
tidal forcing with the M2, S2, N2, OI and K1 frequencies. In addition, surface 
forcing from the 3-hourly ERA-interim (ECMWF) reanalysis is used, which enables 
the simulation to develop a realistic mesoscale field. The horizontal resolution of 
the simulation is 1/60? (about 1.5 km in the mid latitudes) and features 300 vertical 
levels with a thickness starting from less than 1 m at the top of the ocean to 100m 
at the bottom. We processed the hourly outputs of the horizontal velocity u, sea 
level, temperature and salinity fields from which we computed the pressure and 
stratification. 

We will use the 2009 October month over two subdomains representative of 
different dynamical regimes of the ocean. The first one is centred over the Azores 
Islands and the North Mid-Atlantic Ridge. The second one is located offshore 
the North of the US East coast and includes a portion of the Gulf Stream. The 
Azores domain is characterised by the predominance of topographic features such 
as the mid Atlantic ridge exhibiting low amplitudes topographic variation over 
100 km, and a group of seamount with strong slope and a typical scale of 10 km. 
Comparatively to the Gulf Stream area, the mesoscale flow is weak. The Gulf 
Stream domain is marked by a strong mesoscale activity and a large continental 
slope leading to a flat abyssal plain with a few surrounding seamounts (see Fig. 1 
discussed below). 


3.2 Filtering and Computing Methods 


In order to obtain the vertical normal modes bases, the mean stratification N? is 
computed from the time average of temperature and salinity using the nonlinear 
equation of state from TEOS-10. The Sturm-Liouville problem (6) is then discre- 
tised and solved on the staggered vertical grid in each horizontal cell, giving the two 
modal bases on the T grid, at the centre (in the horizontal) of each cell. From there, 
the modal amplitude un, v, and p; are obtained by projecting the corresponding 
fields u and p, following (11)-(13). Since the horizontal grid is staggered, the 
horizontal velocity is located at the edge of each cell. We thus interpolate the 
base ®, before projecting the velocities. Unfortunately, this step induces a loss 
of orthogonality for the newly interpolated basis. The coefficients determined 
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Fig. 1 One-month average of the horizontal energy flux divergence for mode 1 (left column) and 
mode 3 (right column), for the Azores domain (top) and the Gulf Stream domain (bottom). A 
positive value indicates a gain of energy for the mode. Black arrows show the square root of the 
corresponding modal horizontal energy flux. A spatial Gaussian filter with a half modal wavelength 
kernel has been applied on these fields in order to remove the local alternate of positive and 
negative values and make the plot cleaner. Green streamlines indicate, qualitatively, the mesoscale 
horizontal currents at 52m depth, and isobaths at 1000, 2000, and 3000 m are superimposed in 
grey contours 


by innerproduct between the horizontal velocity fields and the interpolated basis 
are corrected a posteriori by inverting the cross-correlation matrix between the 
interpolated basis and the local basis issued from the Sturm-Liouville problem (6) on 
the u, v-grid. This allows us to obtain the projection coefficients u, and v, defined 
on the u, v-points. Vertical velocity is then computed using the continuity equation 
formulated with the discrete scheme employed in the NEMO code [Madec and the 
NEMO team 2008]. 

The next step consists in separating the mesoscale contribution from the semi 
diurnal internal tide. The latter is extracted by means of a complex demodulation 
at a central semi-diurnal frequency œ = 1.415x10^^rads! and a low-pass 
cutoff cutting period of 3 days. The mesoscale flow is directly low-pass filtered 
with the same cutting period. The mesoscale slowly-variable buoyancy fields is 
computed from the daily-averaged temperature and salinity, and the corresponding 
stratification is then estimated from this buoyancy field and a mean sound speed 
profile [using standard formula, see e.g. Vallis 2017]. 

Since our modal decomposition is inaccurate in areas of shallow water, a mask 
is applied to remove all locations shallower than 300 m in the Azores domain, and 
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250 m in the Gulf Stream domain. The mask is slightly shallower in the Gulf Stream 
domain in order to retain the topographic generation occurring at the shelf break. 

Since high modes are expected to propagate over shorter distances from their 
generation site compared to low modes, we usually group the modes 4—10 together, 
and interpret them as part of dissipative processes in the energy budget. Finally, 
all terms of Eq. (15) display small scale patterns, particularly pronounced when the 
mesoscale and internal tides interact. These patterns are not of particular physical 
significance for the diagnostics discussed in this paper. Therefore, we apply a 
Gaussian spatial filter before displaying any term in (15), with a kernel width equal 
to 25 grid points (a little less than 40 km). This procedure is only applied for map 
plotting. 


4 Results 


The energy budget of internal tides in both domains are examined in this section, 
first by considering the geographical distribution of their generation, propagation 
and sinks, and secondly by quantifying the spatially integrated and temporally 
averaged energy budget. 


4.1 Life Cycle of the Internal Tide 


Both the Azores Islands and Gulf Stream domains are sites of powerful internal 
tides generation, exposed to contrasted background conditions. The divergence 
of the energy flux averaged in time over one month (for the period of October 
2009) is shown in Fig. 1, along with the mesoscale currents at 52 m depth and the 
topography. In the Azores area, mode 1 energy flux originates from the seamounts 
and propagates far away (at least 1000 km), indicating a moderate energy loss during 
the propagation (Fig. 1, top left panel). Modes 2 (not shown) and 3 propagate over 
shorter distances. All dominant sources and sinks are located around the topography 
(visible in Fig. 1). For the third mode, we find a local loss of energy just next to the 
generation at the seamount, of the same order of magnitude but smaller value. This 
property holds for higher modes as well (not shown). In the Gulf Stream domain, 
in contrast, the input of energy for mode 1 is located at the shelf break, producing 
a strong offshore beam. This beam manifests strong energy loss as it propagates as 
can be seen in the negative patch displayed in the mode 1 energy flux divergence, 
Fig. 1, bottom left panel. A first hot spot of sinks for the mode 1 is directly next to the 
shelf-break, another being at the encounter with the strong north-eastward current. 
This beam is also refracted by the Gulf Stream, a behaviour previously reported in 
Duda et al. [2018]. As we will show in the next section (cf. Figs. 2 and 3), this 
loss of energy is mainly caused by the advection terms in Eq. (15) and is largely 
converted in mode 2 energy. Mode 3 (Fig. 1, bottom right panel) displays a more 
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complex repartition of sources and sinks, with an overall gain of energy at and near 
the shelfbreak and an overall loss of energy further away from the land, where it 
encounters the current. The sources and sinks of energy from modes | and 3 are 
therefore much less localised around the prominent topographic feature than in the 
Azores domain, and clearly exhibit strong interactions with the mesoscale currents. 


4.20 Importance of the Different Contributions in the Energy 
Transfers 


We now focus on quantifying the causes of the multiple interactions of the internal 
tide with its surrounding which results in the pattern in the energy flux divergence 
previously described (Fig. 1). 


4.2.1 Detailed View of Coupling Terms 


To this aim, the modal energy budget (15) is integrated over the two domains 
considered, averaged over the October month and the different contributions are 
displayed under their matrix form in Fig. 3. The matrix are decomposed into their 
anti-symmetric part (upper triangle) denoting a transfer of energy from mode to 
mode, referred afterwards as scattering, and symmetric part (diagonal plus lower 
triangle) showing sources or sinks for the internal tide. We have only plotted terms 
of (15) that are of first order for at least one mode in one domain. 

Amongst these terms, the topographic contribution is at least one order of 
magnitude higher than all others for the barotropic mode in both domains and for 
the first baroclinic mode in the Azores domain. However, for the Gulf Stream area, 
the advection coupling becomes of similar magnitude for the first mode and exceed 
the topographic scattering for modes 2 and 3. In the Azores domain, this is non 
negligible for modes 2 and 3 but never exceeds the topographic contribution. In 
this region, the second largest contribution is given by the variable stratification 
coupling for the baroclinic modes. In The Gulf Stream area, the mesoscale vertical 
shear production and the horizontal buoyancy gradient loss are also of first order. 
However, as already found in Kelly and Lermusiaux [2016], these two terms seem 
to compensate each other leaving a near zero energy gain for mode |. The same 
phenomenon approximately occurs for higher modes, cf. Fig. 2: the lower triangles 
of buoyancy and shear matrix cancel out. 

These dominant processes are dominated by their anti-symmetric part. The 
topographic contribution is purely anti-symmetric by construction, while the sum 
of the symmetric part for a given mode for the advection coupling is always smaller 
than 10% of the sum of the anti-symmetric part when this term is significant 
in regard to the topographic scattering. This quantity never exceeds 20% for the 
variable stratification coupling in the Azores domain. For the Gulf Stream domain 
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Fig. 2 Matrices Knm 
summarising the mean energy 
transfer integrated over the 
whole subdomains between a 
mode n (row) and a mode m 
(column). The upper-right 
triangle is the anti-symmetric 
part, and the lower-left 
triangle (diagonal included) 
the symmetric part. The sign 
convention follows Eq. (15): 
a positive value indicates a 
gain of energy for the 
reference—"receiving"— 
mode m from n mode and 
conversely. Modes 4 to 10 are 
grouped together. From top to 
bottom are the coupling 
caused by the topography, the 
internal tide advection by the 
background flow, the terms 
implying the horizontal and 
vertical shear of the 
background flow and the term 
implying the gradient of the 
buoyancy field (both variable 
and annual mean). Notice the 
different range of values in 
the colorbar for the 
topographic coupling terms 
compared to the other ones 
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however, it is close to one for the first baroclinic mode, indicating that symmetric 
and antisymmetric parts have comparable magnitudes. The background flow shear 
and buoyancy gradient couplings have a dominant symmetric part, but these two 
terms are cancelling each others. Last, if we sum all the contributions of the modal 
energy budget (15), the symmetric transfer of energy is found to be worth less or 
equal to 1496 for modes 1 , 3 in both domains and 2 in the Azores one. Therefore, 
the interaction between the mesoscale circulation and the internal tides mainly 
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contribute to a scattering of modes equivalent to the one produced by the topography 
without exchanges between the internal tides and the mesoscale circulation and 
buoyancy field. 

As a last comment, Fig. 2 shows that topographic scattering is the only process 
that significantly transfers energy between non-neighbouring modes. For the other 
dominant processes, the energy transfer toward non-neighbouring modes is one 
order of magnitude lower than the one toward a neighbouring mode, except for 
the variable stratification coupling in the Gulf Stream area. For the mesoscale 
contributions, this implies that only the third mode is able to transfer energy toward 
what we consider as part of dissipative processes. 


4.2.7 Modal Energy Budget 


We are now looking at a more aggregated view of (15) thanks to Fig.3 which 
shows the modal energy budget of mode 1, 2 and 3 for both domains. Here we 
have separated the energy transfers with the following decomposition: 


Topographic Scattering : Cnm = PmUn ` Tam — PnUm ` Tmn 

Advection of the internal tide : Anm = Um -(Umn - Vus) — Um ` Un UŽ, + P? Un . 
V Dn 

Shear of background flow : VUnm = Un - D. ‘Um cT Ug: UnnWn 

Horizontal gradient of buoyancy field : V Bam = iy Un - (Bmn + Bun) 

Surface contribution and advection of the mean stratification profile : Sym = 

- - E^ ve 

— dic sew Una) - Vij — 2E Y^, Pn (9m Un Vi (Sr) 


Variable stratification : Nam = P? wn (om, N7¢n) 
m 


We also have separated the symmetric part (*) of the antisymmetric one, and have 
furthermore divided the latter into an energy exchange coming from lower modes, 
and an energy exchange going from higher modes (**). The flux coming from lower 
modes is again divided into an energy transfer coming from the barotropic mode 
(****), and transfers coming from the baroclinic ones (***). A positive value means 
a transfer of energy toward the mode of interest. 

If we consider i the index of the mode of interest and Anm the energy transfer 
matrix already separated into symmetric and antisymmetric part as in Fig. 2, then 
we have : 


doeek — Ag 


i-l 
NE i D Ani 


nz! 
^k — =) niri Ain 
* = bm Ain un NN Ani 
Comparing the integrated value of the energy divergence flux, the Gulf-Stream 


domain is overall less energetic than the Azores domain: mode | has a divergence 
around 4 time larger in the last domain than in the first and modes 2 and 3 are also 
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Fig. 3 Mean modal budget for modes 1 (top), 2 (middle row) and 3 (bottom) for the Azores 
domain (left) and the Gulf Stream (right) domains, as described by Eq. (15). The divergence of 
the modal flux is plotted in red, while others contributions are separated into the symmetric part 
(green) and the anti-symmetric part where interaction with lower modes and with higher modes 
(orange) are separated. The anti-symmetric part coming from low modes is furthermore separated 
into fluxes coming from the barotropic mode (blue) and fluxes coming from the baroclinic modes 
(fuchsia). A positive value means a gain of energy for the mode. Processes taken into account are 
Topo : Cnm , Adv : Anm, shear : VUnm, ssh + Ns : Spm, Buo : V Bym , Strat : Nam 


more energetic in the Azores area. In the results displayed in Fig.3, the mode 1 
in the Gulf-Stream domain is the only one to have an overall negative energy flux 
divergence. 

The barotropic mode has an importance in the energy budget of mode 1 to 3 only 
through the topographic scattering, and is dominant in the energy budget of mode 
1 for both areas. However, it becomes less important for modes 2 and 3, with a 
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contribution to the topographic scattering toward mode 3 inferior to the one caused 
by modes 1 and 2, but still significant. 

Among the first order energy transfers exposed in all panels of Fig.3, none 
transfers energy from small scale to large scale. Mesoscale and topographic 
couplings therefore create a forward energy cascade. 

Last, Fig. 3 enables identifying the terms of the modal energy budget (15) that 
are of negligible importance. The advection of ssh and stationary stratification are 
negligible, which justifies gathering these terms in Eq. (15) in a small residual term. 
Similarly, the astronomical tidal potential (not included in Fig.3) only projects 
weakly on the baroclinic modes. The mean of temporal variations of potential and 
kinetic energy are also small: one month is enough to flatten the two weeks periodic 
variation introduced by the spring neap cycle. 


5 Conclusion 


The semidiurnal internal tide energy budget was characterised based on a vertical 
mode decomposition, allowing a detailed investigation of the energy transfers 
amongst different vertical scales—and associated horizontal scales—of the internal 
tide field. We showed that the Azores area, a region of weak mesoscale activity and 
strong topographic features, is essentially dominated by the topographic induced 
energy transfer from low to higher modes. While this effect is dominant for low 
modes, variable stratification and—to a lesser extent—advection of the internal 
tide by the mesoscale flow become of similar importance for high modes (equal 
or larger than 2). Some questions remain concerning the precise role of the variable 
stratification, in relation with the definition of the modal basis expansion and how 
this definition would change the energy exchanges between modes. In the Gulf 
Stream area, the importance of the advection of the internal tide by the background 
flow is comparable to the topographic scattering, and even greater for modes 2 and 
higher. The variable stratification term is less important. However, the net impact 
of the mesoscale circulation on the energetic of internal tide is mainly limited to a 
scattering of energy from the low to high-order modes. In comparison, exchange of 
energy between the internal tide and the mesoscale fields are negligible: the energy 
exchanged by modes | to 3 with the background is always less than 14% of the total 
energy transfer for the respective mode. 

The type of modal analysis conducted in the present paper on a high resolution 
primitive equation simulation is able to give some insights on the energetic impact of 
mesoscale circulation and its associated buoyancy field on the internal tide and the 
transfer of energy from the astronomical tide toward dissipation. This work could be 
extended by investigating the deep water mixing caused by these interactions. We 
have in particular overlooked for now the internal tide dissipation, but it is worth 
investigating in the near future. 

Finally, the temporal variability of mesoscale—internal tides interactions will 
also be studied in the near future. 


16 A. Bella et al. 


References 


B. K. Arbic, A. Wallcraft, and E. Metzger. Concurrent simulation of the eddying general circulation 
and tides in a global ocean model. Ocean Modelling, 32:175—187, 2010. 

M. Ballarotta, C. Ubelmann, M-I. Pujol, G. Taburet, F. Fournier, J-F. Legeais, Y. Faugére, 
A. Delepoulle, D. Chelton, G. Dibarboure, and N. Picot. On the resolutions of ocean altimetry 
maps. Ocean science, page 1091-1109, 2019. 

L. Brodeau, J. Le Sommer, and A. Albert. Ocean-next/eNATL 60: Material describing the set-up 
and the assessment of NEMO-eNATL60 simulations. Zenodo, September 2020. 

T. F. Duda, Y.-T. Lin, M. Buijsman, and A. E. Newhall. Internal tidal modal ray refraction and 
energy ducting in baroclinic gulf stream currents. Journal of Physical Oceanography, 48, 
2018. 

M. Dunphy and K. G. Lamb. Focusing and vertical mode scattering of the first mode internal tide 
by mesoscale eddy interaction. Journal of Geophysical Research: Oceans, 33:523—536, 2014. 

A. E. Gill. Atmosphere-Ocean dynamics. International Geophysics Series, Academic Press, Vol 
30, 1982. 

S. M. Kelly. The vertical mode decomposition of surface and internal tides in the presence of a 
free surface and arbitrary topography. American Meteorological Society, Volume 46, Issue 12: 
3777-3788, 2016. 

S. M. Kelly and P. F. J. Lermusiaux. Internal-tide interactions with the gulf stream and middle 
atlantic bight shelfbreak front,. Journal of Geophysical Research: Oceans, 121:6271—6294, 
2016. 

G. Madec and NEMO ocean engine the NEMO team. Note du Póle de modélisation. Institut 
Pierre-Simon Laplace (IPSL), France,, No 27, ISSN No 1288-1619., 2008. 

W. Munk and C. Wunsch. Abyssal recipes ii: energetics of tidal and wind mixing. Deep-Sea 
Research, Volume 45, Issue 12:1977—2010, 1998. 

Y. Pan, P. J. Haley, and P. J. F. Lermusiaux. Interactions of internal tides with a heterogeneous and 
rotational ocean. Journal of Fluid Mechanics, 920, 2021. 

A. L. Ponte and P Klein. Incoherent signature of internal tides on sea level in idealized numerical 
simulations. Geophysical Research Letters, Volume 42:1520—1526, 2015. 

G. K. Vallis. Atmospheric and Oceanic Fluid Dynamics: Fundamentals and Large-Scale 
Circulation. Cambridge University Press, 2017. 

E. D. Zaron, R. C. Musgrave, and G. D. Egbert. Baroclinic tidal energetics inferred from satellite 
altimetry. Journal of Physical Oceanography, page 1015-1032, 2022. 

Z. Zhao, M. H. Alford, J. B. Girton, L. Rainville, and H. L. Simmons. Global observations of open- 
ocean mode-1 M2 internal tides. Journal of Physical Oceanography, 46:1657—1684, 2016. 


Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 
International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, 
adaptation, distribution and reproduction in any medium or format, as long as you give appropriate 
credit to the original author(s) and the source, provide a link to the Creative Commons license and 
indicate if changes were made. 

The images or other third party material in this chapter are included in the chapter's Creative 
Commons license, unless indicated otherwise in a credit line to the material. If material is not 
included in the chapter's Creative Commons license and your intended use is not permitted by 
statutory regulation or exceeds the permitted use, you will need to obtain permission directly from 
the copyright holder. 


Sparse-Stochastic Model Reduction A 
for 2D Euler Equations or | 


Paolo Cifani, Sagy Ephrati, and Milo Viviani 


Abstract The 2D Euler equations are a simple but rich set of non-linear PDEs 
that describe the evolution of an ideal inviscid fluid, for which one dimension is 
negligible. Solving these equations numerically can be extremely demanding. Sev- 
eral techniques to obtain fast and accurate simulations have been developed during 
the last decades. In this paper, we present a novel approach that combines recent 
developments in stochastic model reduction and conservative semi-discretization of 
the Euler equations. In particular, starting from the Zeitlin model on the 2-sphere, 
we derive reduced dynamics for large scales and we close the equations either 
deterministically or with a suitable stochastic term. Numerical experiments show 
that, after an initial turbulent regime, the influence of small scales to large scales 
is negligible, even though a non-zero transfer of energy among different modes is 
present. 


1 Introduction 


The 2D Euler equations are a fundamental model for ideal fluids [10]. During 
the last two centuries, these equations have stimulated an intense activity both 
in terms of mathematics and physics (see for example the seminal works of 
Helmholtz and Arnol’d [14, 2]). In computational science and numerical analysis, 
retaining at a discrete level the rich non trivial structure of these equations is still 
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a challenging problem [1, 19]. One main computational issue is the “curse of 
dimensionality”. Indeed, turbulent phenomena vary in different spatial and time 
scales and the distribution of energy over a vast range of scales of motion makes 
it computationally infeasible to fully resolve the flow in a numerical simulation. 
A well-established technique to mitigate large computational costs is large-eddy 
simulation (LES), where a spatial filter is applied to the governing equations after 
which only the large scales of motion are resolved [13, 21]. The filter may be 
defined explicitly, as a smoothing function where the filter width determines the 
level of detail left in the filtered solution and is chosen by the user, or implicitly, 
by coarsening the discretization operator. Either approach requires a closure model 
to represent the effect of the filtered scales on the unfiltered resolved scales. In 
this paper, we apply an explicit spectral cut-off filter to the 2D Euler equations, 
where the cut-off frequency is based on observed intrinsic scale separation. This 
filter leads to a discrete problem formulation in which matrix sparsity can be 
exploited to reduce computational costs. At the same time, this formulation still 
allows for an explicit representation of the effects of small scales on large scales. 
The equations are closed by a deterministic or stochastic model term, based on 
high-resolution measurements. Here, we choose to model small-scale flow features 
using a stochastic term mimicking high-resolution numerical data. Subsequently, 
we analyze the model performance by means of energy fluxes between the scales 
of motion. The proposed stochastic closures and assessment of energy fluxes in the 
high-resolution data can serve as a point of departure for further development of 
stochastic closure models. 

A peculiar aspect of 2D ideal fluids is the presence of infinitely many conserva- 
tion laws. In particular, as firstly described by Kraichnain [18], the conservation of 
energy and enstrophy (the L? norm of the curl of the velocity field) implies a double 
cascade phenomenon: the energy tends to move from small scales to large scales, 
whereas the enstrophy tends to follow the opposite direction. Hence, in terms of 
the curl of velocity, or vorticity, it is possible to clearly separate two regimes: one 
slowly evolving at large scales and one fast at small scales. This was shown to hold 
numerically in [20] for the Euler-Zeitlin model on the sphere. Theoretically, the 
study of non-deterministic fluid models for different regimes have gained interest in 
the SPDE community [12]. The equations studied in [12] and the results proved 
therein, show a precise connection between different space-time regimes with a 
reduced model for large scales. Indeed, it is shown that a suitable model for large 
scales is given by the so-called Stochastic Advection by Lie Transport (SALT) 
equations [15], in which a transport noise term models the infinitesimal action of the 
small scales on the large ones. Several numerical tests have shown the usefulness of 
the SALT equations as a powerful tool for model reduction [6, 9]. 

However, defining precisely what large and small scales are is still an open 
problem. In this paper, we present a criterion for defining large scales in terms 
of truncation of the Fourier expansion. We point out that other choices and 
interpretations of large and small scales are possible (see for example [20]). Let 
us first introduce the governing equations for the vorticity field w, defined on the 
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2-sphere S? embedded in R?: 


o = (v,oj 
Ay — o. (0) 


The Poisson bracket is defined as 
(V, o} = Vy -Vto 


and the Laplacian is the Laplace-Beltrami operator on S?. As mention above, 
equations (1) have infinitely many first integrals: energy H(w) = 1 ie Vo, 
Casimirs Cy(w) = Jy œ", for n > 1, and angular momentum. Understanding the 
role played by these invariants is still an open problem, especially for the long-time 
evolution of the fluid [7]. 

In order to gain numerical insight on this question, V. Zeitlin proposed a spatial 
discretization of (1), which retains many of the first integrals above [22, 23]. The 
Euler—Zeitlin equations are defined as follows: 

= : 

NL = . 

Here W is a N x N skew-Hermitian matrix with zero trace, that is, an element 
of the Lie algebra su(N). The bracket [ P, W] is the usual matrix commutator and 
the discrete Laplacian Ay is defined such that its spectrum is a truncation of the 
spectrum of A [17]. As mentioned above, the Euler-Zeitlin equations possess the 
following integral of motions: energy H(W) — TTr(P W), Casimirs C,(W) — 
Tr(W”), for n = 2,..., N, and angular momentum. The core of the Zeitlin model 
is how the original vorticity c and the discrete one W are linked. Indeed, the 
representation theory of SU (2) provides a deep connection between the discrete 
Laplacian Ay and a particular basis {Tim} of su(N), for? = 1,..., N — 1 and 
m = —l,...,m[17,4]: 


e each Tim is an eigenvector of Ay, with eigenvalue —/(/ + 1), 

* for each N > 1, there exists a linear map pw : C™%(S*) > su(N), defined 
via the (real) spherical harmonics basis {Yj} as py (Yim) = Tim, if and only if 
1<N-1, 

* Ipy(V. o) — N2/?[pyv, pxo]l — 0, for N — oo, where the norm is the 
operator norm. 


The classical way to determine large and small scales is to choose a wave number 
I as a threshold for the large scales (see for example [3, 6]). In this work, we propose 
the following criterion to set the threshold /. Consider a time scale in which the 
fluid's energy spectrum profile has reached a stationary state. Then, typically (that 
is, out of equilibrium) the spectrum exhibits a double slope, which determines a 
kink at a certain wave number /. Then, we define the large-scales W as the filtered 
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vorticity with modes up to /, obtaining a banded matrix. We propose three possible 
ways, both deterministic and stochastic, of closing the equations for W, by choosing 
different interactions with the small scales. Finally, we provide numerical tests to 
assess the different models introduced. 


2 Sparse-Stochastic Model Reduction 


The Euler—Zeitlin equations (2) allow to study some typical features of the 2D fluids 
in the matrix language. In this section, we propose a way to reduce the complexity 
of the Eq. (2), by defining from W a sparse matrix W which retains the relevant 
large-scale information. Then, we show different ways of closing the equations for 
W, by adding a suitable stochastic term. 

In the Zeitlin model, the basis element 775, of su(N) has non-zero entries only 
in the lower and upper +m diagonal. If we look at the anti-diagonals, instead, we 
are looking at the components determining the value of the vorticity field at certain 
latitude bandwidth on SŽ, as shown in Fig. 1. 

The large scales are typically chosen to be the modes such that / is smaller than 
a threshold level /. In the Euler—Zeitlin model, this corresponds to considering the 
banded matrices limited in the diagonals +/ < / and then removing the components 
corresponding to / > /. The Poisson equation which defines the stream matrix P 
preserves this sparsity structure, since the basis elements 775, the eigenvectors of 
the Laplacian, are themselves sparse. However, the Lie bracket does not restrict to 
this space. Indeed at each time-step we have to project the vector field into the right 
space. 

Usually, we do not have any chance to guess the contribution of the small scales 
to the evolution of the large ones. However, we expect that after an initial turbulent 
transition, the fluid exhibits two clearly separated spatial scales. The hint for such 
a scenario is due to several numerical simulations of the Euler—Zeitlin equations 
[3, 20]. Eventually, the energy profile reaches a fixed configuration with two slopes. 
The first part of the spectrum represents the distribution of energy at large scales, 
whereas the second part the distribution of energy at small scales. Typically, the 
separation between large and small scales occurs at a wave number 1 ~ ~N. 


NORTH,m=0 ... — ... In = N-1 
: . : 7 : 
Ma i Uc Sy Ms Í 
: "a c" : 
[n = N-1 e. — ... SOUTH,m=0 


Fig. 1 Structure of the discrete vorticity W in the Zeitlin model 
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Fig. 2 Filtering of large-scale and definition of random small scale vorticity W, via the indepen- 
dent Brownian motions £^ 


For wave numbers larger than / the energy spectrum has the characteristic slope 
of 1-1, which is the one of white noise, see Fig. 3. The universal nature of the 
small scales suggests a model reduction in terms of large-scale evolution combined 
with a stochastic term contribution. In Fig. 2, we show the procedure to get the two 
new fields W and W. To define W, we introduce the orthogonal projection z onto 
the modes / < I. The small-scale field W is defined as the linear combination of 
the basis elements Tj, for | > I with coefficients p as independent Brownian 
motions, with mean and variance obtained from the high-resolution DNS. This 
choice of coefficients ensures that the random field yields the same mean energy 
spectrum at the small scales as measured from the DNS. Additionally, applying 
the Kolmogorov-Smirnov and Anderson-Darling tests for normality to the high- 
resolution data suggests that the distribution of the basis coefficients for Tim, for 
l> l, is Gaussian. 

Hence, we define W := 2W and W := Pe XE as p" Tim. With these 
new fields, we essentially have three possible choices. The first one consists of a 
deterministic closure simply via the projection of the vector field onto the large 
scales: 


(3) 
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which we refer to as the no-model closure. The second model is the large scale 
enstrophy-preserving stochastic closure, which is up to the projection z a type of 
SALT equation (see [15]): 


dW = z|[P, Widt + E 141 X nsi SES p” Utm. wW] o dpim 
Ax P = W. 


(4) 


We recall that the symbol o denotes the Stratonovich integral and indeed we find 
that the large scale enstrophy is conserved, via 


45TQW^) = Tr(WdW) = Tr(Wz[P, W])dt 


N-1 I 
1 = EE 
—  — — TrOn Ti. WI) o dg" = 0 
* 2. à. asi (Wr |Tim, W]) o df 
=/4+] M= 


being 7 W = W and [W, W] = 0. Notice that the other large-scale Casimirs are not 


preserved, since in general nW“ # (nW)*, fork > 1. 

Finally, the third model is a large-scale energy-preserving stochastic closure (see 
[11] for its analysis and [16, 8] for more recent applications). We note that [16] 
introduces this stochastic closure as Stochastic Forcing by Lie Transport (SFLT) and 
provides its general definition. Here, we refer to this closure as energy-preserving 
noise (EPN) and adopt the following definition: 
dW = z[P, W] Jat + Y enn Yt oam P, Tim] o api " 

AnP=W. 

The proof of conservation of the large scale energy for (5) is identical to the one 
of large scale enstrophy conservation by noticing that x commutes with Ay. The 
benefit of the stochastic models (4) and (5) compared to the no-model closure is 
that the stochastic models provide an explicit representation of the small-scale flow 
features and, doing so, aim to truthfully affect the evolution of the large scales of 
motion. No such representation is included in the no-model closure. In the next 
section, we perform a numerical test for the three different models (3), (4), (5), 
comparing them with the high-resolution DNS. 


3 Numerical Simulations 


In this section, we carry out a numerical experiment to study the performance of the 
models proposed in the previous section. The numerical experiment is conducted as 
follows. We set the high-resolution level at N = 128. Then we generate a random 
initial condition and we run a high-resolution DNS. We stop the simulation once 
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Energy 


10 10 
Wavenumber 


Fig. 3 Initial vorticity obtained via high-resolution DNS. Top left, the field W, top right, the 
filtered field W, bottom left, W — W, bottom right, the energy spectrum of W. Note the change of 
slope in the energy profile at | ~ /N 


a stationary energy profile is reached (see Fig. 3). The solution at this point in 
time defines the initial condition of the reference solution and the ensuing model 
simulations. 

From the DNS we select the large-scale threshold as wave number / ~ /N, at 
which the kink in the energy spectrum appears. In our numerical simulation the kink 
is found to be at / = 14. 


Remark 1 The kink in the energy spectrum must depend on the truncation level 
N. Indeed, it was shown that the tail of the energy distribution at small scales for 
conservative schemes, like the Euler-Zeitlin one, has a characteristic slope of koh, 
This would imply unbounded energy for N — ov, which contradicts the fact that 
we only consider vorticities with bounded energy. Therefore, the kink wave number 
between the two slopes in the energy profile must increase with N. Numerically, we 
have observed that it grows like JN. 


Then, we define our large-scale field as W := xW, where zx denotes the 
orthogonal projection onto the modes for / < I. The projection consists of two 
steps: first we extract the components up to / and then we generate the field W. 
The cost of calculating each component is O(N) and since we need to repeat this 


operation Poi N times, the total cost of extracting the components is O(N’). 
Clearly, to construct the field W we have to perform O(N?) operations. Hence, the 
total computational cost of the projection x is O(N?). Hence, given two matrices 
A, B € su(N), the cost of evaluating zr [zt A, z B] is given by the evaluation of 
7z plus the cost of multiplying z A and 2B. Since we are interest only in the +/ 
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diagonals we need to perform O(N. 1) vector-vector multiplications of the cost oD, 


which implies a total cost of O(N Py = O(N?). The same cost O(N?) holds for the 
stochastic term, since we can actually consider only m — —l, ...,1. We also define 
W, as explained in the previous section. Finally, we define the reference solution 
as the high-resolution numerical simulation using the initial condition previously 
defined. The large-scale field obtained by projecting this initial condition serves as a 
starting point for the simulations where the small scales are modeled as described in 
Eqs. (3), (4), (5). All simulations are run for 250 time units from the initial condition. 
In our numerical simulations, the time integration is done via the Heun-type scheme 
adapted for the SDEs, with time-step h = 0.25. In the following, we perform 
an ensemble of realizations for the stochastic closures. Each ensemble consists of 
25 realizations, which is sufficient to show the qualitative difference between the 
proposed models. 

We notice from Figs. 4 and 5 that the no-model solution and the mean SALT 
solution perform well compared to the reference solution, in terms of qualitative 
vorticity evolution as well as the kinetic energy spectrum. On the contrary, the 
energy-preserving scheme loses accuracy and a cascade of energy to lower wave 
numbers occurs. We attribute the observed difference between SALT and EPN to 
two causes. Firstly, the orders of magnitude of the stochastic terms entering the 
equations differ between the methods. The stochastic forcing types are parametriza- 
tions of components of the effect of small scales on large scales, respectively given 
by z[P, W] and z[P, W]. Figure 6 shows these quantities for a high-resolution 
snapshot, which illustrates that the term parametrized by the EPN is significantly 
larger than the term that SALT parametrizes. Therefore, it is reasonable to expect 
that the proposed use of EPN leads to more substantial deviations from the no-model 


Reference No model 
0.2 0.2 
0.1 0.1 
0 0 
-0.1 -0.1 
-0.2 -0.2 
Mean SALT Mean EPN 


0.2 0.2 
0.1 0.1 
0 0 
-0.1 -0.1 
-0.2 -0.2 


Fig. 4 Evolution of the large scales over 250 time units, via the different models proposed and the 
high-resolution simulation. No model corresponds to (3), SALT to (4) and EPN to (5). The shown 
results using SALT and EPN are the mean of an ensemble each consisting of 25 independent 
realizations 
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- - Ensemble mean + one standard deviation 
— Reference 
— No model 


2 4 6 8 1011 2 4 6 8 101214 
Wavenumber Wavenumber 

Fig. 5 Energy spectra at t = 250 using SALT (left) and EPN (right), both computed from an 

ensemble of 25 independent realizations and compared to the reference and no-model energy 


spectra. Note that the standard deviation of the ensemble obtained using SALT is too small to 
discern in this figure 
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Fig. 6 Components of the evolution of the reference vorticity projected onto the large scales, 
generated from the reference vorticity field at t = 250. Note that the ranges of the color bars vary 
per field, which highlights the difference in magnitudes between the fields 


simulation than the use of SALT. In fact, the proposed use of SALT only leads 
to very small changes compared to the no-model setting. Secondly, in the energy- 
preserving scheme, no energy can leave the large scales. Hence, if the transfer of 
energy between different modes is non-zero, the conservation of the large-scale 


energy prevents the energy to flow from large scales to small scales, causing an 
extra accumulation of energy / ~ l. 
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In order to check this thesis, we compute the energy transfer among different 
modes in the high-resolution DNS. Let us consider the energy at a level /: 


E()— S 
0-3 2 (+D 


m=—I 


Then, the energy variation in time is given by 


l 
dE(D E Olm| P, Wim 
dt 2 41) ` 


Let F(l) :— [£50 be the absolute value of the energy transfer due to the 


nonlinearity of the vector field [P, W]. In Fig. 7, we plot the energy transfer 
contributions of the four possible couplings of large and small scales. We notice 
that the transfer of energy between large and small scales is non-zero. In particular, 
the main driver of the energy for the components of W is the vector field [P, W], 
whereas for small scales it is [P, Wi. 

The plots in Fig. 7 show that the term [P, w] becomes more and more relevant 
in the energy flux at large scales, while approaching the threshold level /. Therefore, 
one might expect the stochastic model (5), which takes into account this term too, 
to be more accurate than (3) or (4). However, form Figs. 4 and 5 this does not seem 
the case. Indeed, the way we model the term [P, w] in (5) prevents the energy from 
flowing from large scales to small scales. However, also in (3) there is no energy 
flow from large scales to small scales. Therefore, we suggest that at large scales 
the term [P, W] is responsible for redistributing the energy from lower to higher 
frequencies and in absence of a dissipation mechanism the rearranging of energy 
diverges from the correct spectrum. 

The fact that (3) and (4) perform equally well is quite surprising and indicates that 
the small scales do not affect the large scales much when a stationary energy profile 
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Fig. 7 Energy transfer among different modes computed from a snapshot of the high-resolution 
simulation. Left, energy transfer among modes at large scales, right, at small scales 
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is reached. However, for much longer times than those we have run, it is possible 
that the effect of small scales on large scales become more relevant and (4) becomes 
more accurate than (3) in terms of large-scale dynamics. Further investigations on 
this aspect are ongoing research and will be presented in future work. 


4 Conclusions and Outlook 


In this paper, we have presented a possible strategy to reduce the complexity of the 
Euler—Zeitlin model, while performing long-time simulations. Numerical evidence 
shows that the Euler—Zeitlin equations exhibit a clear separation of scales such that 
the large-scale dynamics are quite robust to different couplings with small scales, 
either deterministic or stochastic. Interestingly, the energy-preserving scheme we 
have defined shows that the energy at large scales cannot be exactly conserved. This 
means that large and small scales are never completely decoupled, even when the 
energy spectrum profile reaches a stationary regime. This indicates that for very 
long times a non-zero transfer of energy among different scales is present. 

The Zeitlin model has been criticized for unrealistic conservation of enstrophy 
and other Casimirs at a finite level of truncation N. Our result shows that this issue 
can be understood such that the Euler—Zeitlin equations are quite robust and precise 
in describing large scales, which means for wave numbers / ~ 4/N. On the other 
hand, the remaining modes are themselves a model for the small scales, which 
correctly mimic the energy flux among different modes. 

In conclusion, we have shown that the Zeitlin model can be a useful tool for 
simulating long-time large-scale dynamics. In future work, we aim to perform more 
systematic simulations using the parallelized code developed in [5] and available 
on https://github.com/cifanip/GLIFS. Additionally, further analysis of energy and 
enstrophy transfers between the scales of motion may serve to derive tailored data- 
driven stochastic closure models for the Euler-Zeitlin equations. 
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Franco Flandoli, Silvia Morlacchi, and Andrea Papini 


Abstract The effect of transport noise on a 2D fluid may depend on the space- 
scale of the noise. We investigate numerically the dissipation properties of very 
small-scale transport noise. As a test problem we consider the Kelvin-Helmholtz 
instability and we compare the inviscid case, the viscous one, both without noise, 
and the inviscid case perturbed by transport noise. We observe a partial similarity 
with the viscous case, namely a delay of the instability. 


1 Introduction 


Stochastic transport is a new fundamental perspective on fluid dynamics, see e.g. 
[29, 39, 13, 8] and [22]. A transport type noise in a fluid dynamic model may be 
seen, loosely speaking, as a simplified description of small-medium space-scales of 
motion. In the numerical simulations (see for instance [8]) we may observe the way 
it perturbs large scale motion; in general, this perturbation destabilizes large scales 
producing smaller eddies. 

This has been observed for homogeneous noise where small scale perturbations 
with possibly local backscattering of energy are observed. While there are cases in 
which transport noise has been shown also to trigger secondary circulation at large 
scales for inhomogeneous noises [3]. 

In this note we want to explore how transport noise may affect large scales, some- 
what opposite to the one mentioned above. The key difference is the assumption 
that it is very-small-space-scale. The noise used below is made of very small, low 
intensity, vortex structures. In such a case it may happen that the transport noise acts 
as a dissipation, an additional viscosity. 

It corresponds to Joseph Boussinesq intuition [6] that “turbulent small scales 
may be dissipative on the mean flow”. The physical intuition, beyond the specific 
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mathematical derivation, is that fluid particles move so erratically to produce effects 
similar to the molecular motion. In a sense, it is like a macroscopic version of the 
molecular dynamics: as macroscopic kinetic energy transfers to molecular kinetic 
energy (heat), reducing the macroscopic motion, similarly large-scale kinetic energy 
moves to small scale turbulence and reduce the intensity of the mean flow. There 
have been many attempts to prove the validity of this picture, see for instance [5, 16, 
21, 32, 30, 44], but a final conclusion is not clear, also because the empirical validity 
is sometimes moderate [32, 41]; in particular, the inverse cascade in 2D seems to 
break the viscosity effect due to small-scale turbulence [14]. 

However, this idea is certainly very useful for numerical simulations of fluid 
dynamics model, being the basis of the LES method [5], and, in case of vorticity 
equations, the vortex blob method [11]. 

Theoretically, this kind of turbulent-small-scale transport noise has been investi- 
gated in rigorous works. This line of research was initiated in [23] and developed in 
several works, see e.g [16, 19] and many others (see [22]). 

However, its stabilizing power has not been tested numerically yet. Here we 
observe its action on one of the strongest and most common instabilities: the 
Kelvin-Helmholtz one. This is an instability that occurs when a velocity difference 
across the interface between two fluids is present, producing macroscopic vortex- 
like structures. For a description of Kelvin-Helmholtz instability see [31] or [36], 
Chapter 6, and our description below; as a typical instability of shear flows, see 
also [14]. 

We compare the inviscid case, the viscous one, both without transport noise, and 
the inviscid case perturbed by transport noise. The results are described in Sect. 4 
. Since we approximate the fluid dynamic equations by vortex methods, which fits 
quite well and in a unified way with the three cases analyzed here (inviscid, viscous 
and stochastic transport), we describe some preliminaries on this topic in Sects. 2 
and 3. 


2 Model Formulation 


We consider the two-dimensional flow of an incompressible fluid on a 2D domain, 
either the full plane or T?, the 2D torus with periodic boundary conditions. 

As usual, the equations for the motion of the fluid are the conservation of mass 
and linear momentum, see e.g. [2], expressed through the Navier-Stokes equations 
in the null-divergence formulation. The main interest for our numerical simulations 
is the equation for the evolution of vorticity w(t, x), which can be derived from the 
Navier-Stokes equations: 


Ojo c -u- Vo = v ^o. (1) 


Here v is the kinematic viscosity, and u is the velocity of the fluid solving the NS- 
equation. In the 2D case, we can express w :— V x u as the curl of the velocity 
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field, a vector normal to the flow plane. We denote x = (x1, x2) to be any point in 
the domain. 


2.1 Point Vortex Method for Inviscid Flows 


In order to state the problem we investigate numerically, we first recall several 
well-known facts; for a general introduction, see e.g. [36, 37, 43]. For the sake of 
simplicity, we focus on an inviscid fluid; the vorticity equation (1) reduces to, 


Qj -Fu - Vo = 0, (2) 

From the null-divergence hypothesis, if u has zero average we can define a stream 

function associated to the fluid w(t, x), see [2], such that the velocity u is given by 
u = — Vy. We obtain the stream function by solving the Poisson equation 

Ay = —o. (3) 

To compute the solution, we need to express the Green’s function and the convolu- 


tion with the variations of constants methods, which for flows in the full plane R? 
reads as 


vex»= f Ui yat diet kd (4) 


where G is the Green's function, the fundamental solution of the Laplace equation. 
The explicit form of G in the full plane R? is 


1 
Gay) = zz logie = yD). (5) 


Using (4) we obtain the velocity field 


u= f K(x — y) w(t, y) dy := K «a, (6) 
where K is given by 
K(x) 2 V+G(x). (7) 


Equations (6) and (7) for the velocity are known as the Biot-Savart law, and K is the 
Biot-Savart Kernel. Note that we must correct this velocity for flows over bounded 
or periodic boundaries domains to satisfy the boundary condition (See Sect. 4.1, for 
details). Therefore, we have to change the Green’s function according to the Poisson 
equation. 
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Consider now a fluid particle X;, moving in the velocity field; from (2), the path 
of the particle is 


Ed (t, X;) 
LU ah. 
dt! t 


w(t, X1) = wo(Xo). 


Therefore, considering the fluid as distinct “fluid particles” of constant vorticity, 
these abstract objects’ motion determines the scalar field’s evolution. This is the 
premise of the point vortex method to compute (2), (see [36, 37] and [43] for details). 

To this end, consider N point vortices, idealizing a 2D inviscid fluid and occu- 
pying positions X 2 es X N , with intensities (circulations) I, ..., Ty respectively. 
They move accordingly to the following set of ordinary differential equations 
derived from the previous computations (8): 


dx! |o vj 
A = Y rk (xi. x/) (8) 
jzi 

1 @-y)t 
27 yr 
in full space, suitably modified on a torus or in a bounded domain (see Sect. 4.1). 
We refer to [15] for well-posedness of the ODE system for almost all initial 
configurations (including the case when the system is perturbed by additive noise). 
One can prove that the empirical measure 


where the vector-valued kernel K (x, y) is the Biot-Savart kernel, equal to 


N 
æ (t,-):— » V6 yi 
i-l 


is a weak solution of 2D Euler equations in vorticity form (suitably interpreted 
for distributional fields as in [42]), see [24] and [26, 27] for a discussion of well- 
posedness in this context): 


Qj -F-u - Vo — 0 


wx = [ KG yet yay (9) 


@|1=0 = 0 


with wo := 3g Djó xi: Here o is the (scalar) vorticity, u is the (vector) velocity. 
Moreover, consider the empirical measure parametrized by N and rescaled by 
1/N: 


N T; 
ont, )= >> xxi 


i=l 
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In [36] a convergence result to Euler equations was proved under some restrictions 
(specifically, in that case X : solves a vortex dynamics with mollified Biot-Savart 
Kernel, with vanishing mollification as N — oo, with rates of converge of the 
parameters a bit far from optimal). The best result in the literature of convergence 
to Euler equations was then [42], see also other references therein. 

Notice that the velocity field associated to the distributional vorticity wy (t, +) is 


N 
1 ] 
un (t, x) := A (s. xi). 
i-l 


p 


This is a well defined vector field, of class L;,. 


for every p « 2 but not for p — 2. 


2.2 Point Vortex Method for Viscous Flows 


To investigate viscous flows, following [10, 33, 38], we modifying the previous 
scheme by adding independent 2D Brownian motions wi, ints wr to the equations 
of point vortices 


dX; = Y TK (Xi, xi) dt + V2vaw;. (10) 
z 


Then, the empirical measure wy (t, -) converges weakly, in probability, to the unique 
solution of the 2D Navier-Stokes equations in vorticity form 


dw +u- Vo = v ^o 
wx [ Kod. 94» (11) 
c|;—o = «o 


We refer to [7] for convergence to 2dNS on the full plane, and to [33, 38, 25] for 
results in the case of domains with boundary. 


3 Point Vortex Method with Environmental Noise 


As mentioned above in the Introduction, several works, e.g. [29, 39, 13, 28, 9, 8] 
and [22], indicated an interest in the following stochastic modification of the Euler 
equations 
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do +u-Veodt = M or: VoodB; (12) 
keK 


where ox = ox (x) are given vector fields, that we assume divergence-free, (BF ) keK 
are independent 1D Brownian motions and the stochastic operation o stands for the 
Stratonovich integral. Due to this, formally, vorticity is conserved (it is transported 
randomly by the field u dt + Y 4. y ox o dB*). 

This model bears similarities with the viscous flows, and the objective of this paper 
is to show differences and similarities of the elliptic operator obtained from such a 
transport-advection noise. 


3.1 Transport Noise and Deterministic Scaling Limit 


The point vortex dynamics associated to the model (12) is then given by the 
following expression: 


axi = YK (xi xi)d+ Y^ (xi) o aay. (13) 
jži keK 


Notice that this is a model of common noise (also called environmental noise): the 
BM’s Bk are the same for all particles, in contrast to the model (11) where each 
particle X! was affected by an independent BM W}. See [18] for an example of 
theoretical results on this model. For models similar to this one, it has been proved 
(see e.g. [12]) that the empirical measure converges to the solution of the SPDE (12). 
Atthe same time, following [23] and subsequent works, if the noise is parameterized 
in such a way to become more and more small scale, the SPDE (12) converges to the 
deterministic equation with additional viscosity (see Sect. 3.2 for details on possible 
noise selection) 


jo +u: Vo = v Ao. (14) 


Inspired by [20], we consider a sort of mixed scaling limit: we take the point 
vortex dynamics with common noise (13), which for given fields og would converge 
to the SPDE (12), and we choose more and more small scale coefficients o; in order 
to be close to the deterministic equation (14). 

The scheme just described has two parallel interpretations. If we look at the point 
vortex dynamics just as a numerical method of discretization of Euler equation, 
the transport noise in the point vortex dynamics (13) is just like the transport 
noise in the Euler vorticity dynamics (12). Therefore the scaling limit when the 
coefficients og more concentrated is a numerical realization of the theoretical scaling 
limit investigated in [16], from the stochastic Euler equation to the deterministic 
Navier-Stokes equation and, as such, it is a form of proof of the validity of 
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Boussinesq hypothesis that small scale turbulence enhances the viscosity. That is 
the first interpretation. The second one is more related to particle systems and 
noise. Consider the particle system (13), having common noise and compare it 
with the more classical particle system with independent noise acting on each 
particle (Sect. 2.2 above). In the scaling limit of oy described above, the common 
noise becomes more and more spatially de-correlated, close to independent noise at 
each space position, and thus it acts almost independently on each particle. Since 
point vortices with independent noise converge to the Navier-Stokes equations, it 
is natural to expect that also with common noise but very concentrated o; the 
empirical measure of the particle system is close to the solution of the Navier- 
Stokes equations. This viewpoint has been investigated theoretically in [20], while 
the subject of linking the turbulence stresses to the mean flow, which is in synthesis 
the Boussinesq idea, was explored in appendix A and B of [40] and theoretically in 
[17], providing a link to eddy viscosity and an environmental small scale noise. 


3.2 A Digression on the Theoretical Selection of the Noise 


In this section, in the same spirit as [20, 22, 40], we explore some property of the 
environmental noise that we use in a simplified way in our numerical simulations. 
Following the works on modeling of passive scalars [34], when considering the 
scaling limit of (13) to w(x) solution of the viscous Euler equation (14), we consider 
a model of noise in the fluid which is delta-correlated in time, namely a white noise 
with a precise space dependence. 


W (t,x) dt = 3 ox (x) dB; (15) 
keK 


where (o, (X))x is a family of smooth divergence free vector fields on the 2D domain 
of the equation, and BF are independent one-dimensional Brownian motions; K is, 
usually, a finite index set, but with suitable assumption we could consider also the 
case of countable family of smooth fields. 

In this case, the term W (t, x) - Væ (x) obtained in the convergence result of the 
point vortex empirical measure, must be interpreted as a Stratonovich integral 


Y ok (x): Vo (x) o dB. (16) 
keK 


Assume that the solution is sufficiently smooth so that the Stratonovich integral 
makes sense, then this is given by an Itó-Stratonovich corrector plus an Itó integral; 
precisely, is given by: 
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1 
x > ox (X) - V (ox (X) - Vo (x, v)) dt + dM (t,x) 
keK 


where M (t, X) is a (local) martingale. Follows that the Itó-Stratonovich corrector 
takes the form of an elliptic operator: 


1 
-3 div (C (x, x) Væ (x)) dt 
where C (x, y) is the space-covariance function of the noise 


C(x, y) = ook (X) @ oxy). 


keK 


As an example, we take the noise [34], which is relevant to our numerical 
investigation in the choice of the divergence-free field in the point vortex model. 
For simplicity, assume the domain to be R?, but modifications on T? are possible, 
see for example [16, 20]. 

Its covariance function is space-homogeneous, i.e. C (X, y) — C (x — y), with 


the form 
1 l k&k 
C (z) = w f JF ez (1 — = ) an. 
kox|k| «x; Ik|^** Ik] 


The famous Kolmogorov 41 case follows if we take ¢ = 4/3. Taking kı = +o, 
then C (0) = Ko? where the constant K is given by 


" -f 1 (i rex) " 
1«|k| «oo Ik|^** Ik]? ) 


We consider small-scale turbulent velocity fields depending on a scaling parameter 
and taking the scaling limit in (12), as in [20, 23]. In the case of [34] we have 


ko = ki’ > oo 


The result C (0) = Kv is independent of N, so that the Itó-Stratonovich corrector 
becomes equal to 


vAo(X), 


and simultaneously, we may have that the Itó term goes to zero, hence recover- 
ing (14). 

Let us sketch the argument (see [16] for details) which explains why the Itó term 
may go to Zero, in spite of the convergence to a finite non-zero limit of the Itó- 
Stratonovich corrector. Let ¢ be a smooth test function. One has 
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T 2 T 
z pi eis von dod) = [f m2 
keK keK 


by the isometry formula of Itó integrals, 


T 
= spam lon bd 
kek”? 


since div og = 0, 


T 
= | Í [ [Xo vea d 


keK 


T 
= | f J [vet ee»: ve (x) v (t, x) o (t, »dsdyat| 
0 


T 
= uf (CO;, 0;) p» dt 
0 


where is the linear operator on vector fields with kernel C (x, y) and 06; (x) = 
Vo (x) o (t, x), 


T 
n] 2 
< (Ole nf l8; lz ar. 
0 


Now, one can prove uniform bounds on E i. IlO; I2; dt with respect to the scaling 
of the noise and one can choose a noise such that ||C||j» , 2 goes to zero. Notice 
that in the Itó-Stratonovich corrector only the diagonal C (x, x) counts, while the 
smallness of ||C||;». , z2 is related to the smallness of C (x, y) when x Æ y. 


4 Numerical Results 


4.1 Setting: Kelvin-Helmholtz Instability 


In this section, we investigate classical results on the shear flow model in the setting 
of point vortices, analyzing the Kelvin-Helmholtz instability and the possibility of 
delaying the structure formation. In this way, we can both test the goodness of our 
point vortex models and, at the same time, set a benchmark for which we will show 
delayed instability. In order to test the point vortex model in the classical cases (8) 
and (10), we choose the particular fluid configuration of a shear flow because of its 
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fundamental property: developing instability without viscosity and delaying it when 
viscosity is present [36]. 

We work on the torus T? equal to the set [—1, 1]? / ~ with coordinates x = 
(x1, x2) and identified boundaries at x1, x? = +1; all fields are periodic in the x1 
and x2-direction. We take an initial velocity uo of the form 


0 0 
u? (x1, x2) = (uf (x2) .0) 
and corresponding vorticity wp = ðu? (x2). We choose, in particular, the function 


-lif x2 <-—ô 
ul (x2) = 4 € if—8 <x) <6 (17) 
lif <x 


where we fix 6 = 0.02 in our numerical simulations. 

To compute the vorticity measure of our point vortices, we use vortex blobs, 
obtained by spreading the circulation of a point vortex over a chosen small area, the 
vortex core (see e.g. [43]). In this formulation, the vorticity field is approximated by 


oN (x,t) = M Ti dex — XD, (18) 


where the mollifier ¢, (in our numerical simulations a Gaussian kernel with width 
dependent on the subscript e, the characteristic size of the vortex core) describes the 
vorticity distribution in the vortex core. Following standard numerical techniques 
(see e.g. [4, 1]), the core size € of the vortices has to be much larger than the average 
spacing d between the vortices; the core size is usually taken to be € = d^, with 
q <<. 

In (18), the vorticity distribution at any time depends on the point vortices x! 
through the vortex blobs. In our numerical simulations, we take N ~ 10^ point 
vortices; following a mean-field approach, the initial circulation for every given 
point vortex is derived from u? and is equal to ri = a We solved the point vortex 
model (8) using a 2nd order Runge Kutta scheme coupled with a Heun techniques 
for the noise, for a second-order time discrete approximation; the time step for our 
simulations was selected to be At ~ 10^? to ensure a trade-off between the stability 
of our method and the generation of vortex-like structures in the shear flow model. 

In the usual way, we also recall that in our numerical framework, the kernel 
K in (13) corresponds to the Biot-Savart kernel. We have that K = vig = 
(d2G, —81G), where G is the Green function on T?. In the whole plane we have 
the simple expression Gg» = x- log |x|; while for our domain we know that 


G(x) = 5 log |x| + s(x), Vx € T? \ {0}, (19) 
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and s(x) is a smooth function on T?. Thus, K is divergence-free, smooth away from 
the origin, and symmetrical; moreover, it holds the following behaviour: 


|K(x)| ~ 1/|x|, as |x| > 0, 


1 G-» 

Wr pep 
throughout the numerical simulations. Without ambiguity, from here on, we con- 
sider the horizontal and vertical axes as our reference frame, naming them the x-axis 


and y-axis, as usual. 


which we extensively use to approximate our kernel with Kp2(x — y) = 


4.1.4 The Role of Intrinsic Instability 

We know that, at least formally, the vector field u = u? is a solution of Euler 
equation (14) with v = 0. This system is unstable: small perturbations rapidly 
develop vortex blobs. As such, we consider the system of point vortices (X7); with 
initial vorticity, derived from (17), and expressed as 


1 1 
aon, x2) = uc 2. 55x Œ x2), (20) 


where the circulation for each point is equal to us and the initial positions of the 
vortices Xi Vi = 1,..., N are uniformly distributed on the strip [—1, 1] x [—4, ô]. 
The randomly generated initial condition represents small perturbations in the 
system and is fesponsle for the different pattern formation. 

The measure aN LE wr ô xi converges, in distribution, to the scalar vorticity 
field solution of the Euler equation (14); analogously with the continuous case, we 
see in Fig. la, c the development of instability in the form of macroscopic vortex- 
like structure on the boundary of the two fluid layers. 

Note that the number of such macroscopic vortex-like structures and their 
position is entirely dependent on the initial condition: small perturbations on 
the randomly generated point vortices can produce entirely different macroscopic 
vortex-like structures, hence the instability of the two laminar fluids’ profile. 


4.1.2 The Role of Viscosity and Stability Restoration 


The exact solution of the Navier-Stokes equation (14), with v > O and initial 
condition u?, is given by 


u (t, x1, x2) = (uy (t, x2),0), 


where u1 (t, x2) solves the heat equation, 
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ðu = vð? ui (21) 


u1 (0, x2) = u? (x2). 


Due to the spreading of the profile uo, the solution becomes more stable; namely, 
the development of macroscopic vortex-like structures is delayed. In our numerical 
simulations, we reproduce this phenomenon by perturbing the system (8) through 
independent Brownian motions Bi, i = l,..,N, with variance linked to the 
viscosity parameter: Var(B]) ~ 4/v. This system converges to the exact solution 
when N — oo; however, in our numerical study we are dealing with a finite system. 
For this reason, the profile of the strip remains quite stable for short times, with just a 
spread along the y-axes. We report in Fig. 2a, b a single configuration at two different 


Fig. 1 v = 0. (a): initial 3 
configuration, approximating 
a shear flow fluid dynamics; E 


(b): iteration t = 50, 

formation of macroscopic > 
vortex structures; (€): 

iteration t = 100, perfectly 3 
developed macroscopic 
vortex structures 


00 


-0.2 


02 
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timesteps, t = 50 and t = 100; we take „/⁄v = 0.095 to better focus on the stability 
restoration. Our results are in agreement with the theory: from a comparison with 
Fig. 1b, c, we see that when v > 0, the profile is much more stable and diffused 
than in the deterministic case, and macroscopic vortex-like structures appear only at 
large times. 


4.2 Numerical Results on Environmental Noise 
4.2.1 Selection of Divergence Free Field 


Starting with the same initial condition (17), we consider N + M point vortices, 
each of which we associate with a position in T? in the following way: 


X d Plu 


Here, the vortices YÍ, i = 1,..., M do not move, and when their interactions with 
point vortices became not negligible, they represent the feedback of small-scale 
turbulence acting on the fluid itself on large scales. 

The new simulated vortex dynamics for X : in T? as in (13), reads 


Fig. 2 v > O. (a): iteration 34 
t = 50, preservation of strip 
profile; (b): iteration t = 100, 34 


first development of the strip 
profile’s instability, formation 
of large rotating structures 
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dX! = 2e (xi - X, are Soup odW/ 
l l E, 


with periodic boundary conditions, where W/ are Brownian motions, all indepen- 
dent and uni-dimensional, and they are acting simultaneously on all the particles i = 
1, ..., N. The environmental noise follows the Stratonovich integral prescription, 
automatically implemented in Heun's method [35]. 

We choose the divergence-free vector fields ø; as 


o; (Xl) c a" K (xi z y’) , j=l, M 


following the theoretical analysis performed in [22, 20]. Here, the intensities aM 
are linked to the scaling limit process, which produces a viscosity term on the 
large scales, and K, the Biot-Savart kernel, simulates the action of such small 
vortices. The idea behind such a selection is that we want to exploit the same 
features of the vortex model, with the formation of small-scale vortex structures 
generating feedback on the entire configuration. In the limit, the dynamics of such 
small structures, modulated through a Brownian motion, rebound on large scales, 
perturbing their motion with their dissipative properties and delaying the formation 
of the instability. 


4.2.2 Positions and Intensities of Fixed Vortices 


In order to connect with previous studies [20], we choose the positions of the fixed 
vortices Y/ and their intensity a j according to the convergence of the scaling 


limit (4.2.1). More precisely, at each timestep, we generate Y/, j = 1,...,M, 
uniformly distributed point vortices; their position on the y-axis is apriori selected 
in the interval [—óp x, óp x]. In this setup, the vortices Y J are generated in a strip of 
variable height 20 p y ; this strip contains the moving vortices X y , and it is taken to be 
of the same height of our boundary fluid layers, or one order of magnitude greater. 
This choice emphasizes that our proposed “small-scale” structures should act on 
all points of the fluid in all directions: the average contribution of the Y/ on the 
X : along every direction should mimic a Brownian motion. We explored different 
setups of positions and intensity; we selected meaningful realizations, as reported 
in Table 1. 


Table 1 Parameters of the M 


: Fede ópx m a 
discussed realizations 


200000 |0.1 | 0.0014 | 0.0005 
132000 | 0.07 | 0.0014 | 0.0005 
1000 | 0.07 | 0.0017 | 0.005 
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We choose the intensity of the “small-scale” perturbations following heuristic 
considerations. We consider the mean inter-particle distance between two fixed 
point vortices, < r >:= ./1/m, where m := M/A is the particle density and A 
is the total area occupied by the M vortices. 

Then, we focus on a single moving vortex, X 5 we compute the magnitude of its 
velocity when X i is at a distance d =< r > /2 from the nearest fixed vortex, i.e., its 
position is halfway between two fixed vortices. As a result, we obtain the following 
estimate for the velocity of X: 


NM. og 
J 


ya 1 xj- Yit 1 ys 1 
F J An |Xxi — YJ|? 4r ; d 


a 
EM 


j 


where we suppose that the coefficients depend on the configuration (X Di, and are 


equal for each j = 1, ..., M. What is left is to estimate the number of fixed vortices 

such that the interaction with X; is not negligible: let us call this number K, giving 
~ Ka" 

u 7 


Thus, using the theory from the scaling limit of environmental transport noise 
(see e.g. [22, 20, 23]) and the construction of Sect. 3.2 for point vortices with 
transport noise, we assume that 


It remains to estimate K, the number of the nearest fixed vortices: consider a ball 
centered in X; with radius d, so that the area is Anear = z d?. We recall that m is 
the density of the fixed vortices, then the nearest vortices are: 


Taking into account only the nearest vortices, we are underestimating the actual 
contribution of all the vortices. In particular, we should compute such contribution 
by considering a radius dependent on the effective range of the image of the Biot- 
Savart kernel. In fact, being K ~ ee , contribution for distant particle is negligible. 
For this reason, we empirically selected a wider radius ad, with a ~ 3. Concluding 
we get our estimates for the intensity 
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4.2.3 Effect of Small Scale Common Noise 


As recalled in the previous paragraph's heuristics, the procedure of the scaling limit 
is preserved when both N and M are large, and the intensity p is small. For this 
reason, we do not search for the same exact solution of the Navier-Stokes equation 
(14), v > 0, with initial condition u9, as per the case of the independent noise. 
However, since the regime tends, in the limit, to the same solution, we expect a 
diffusive effect on the strip of the point vortex. More precisely, we expect to see a 
delay in the formation of macroscopic structures and a more dispersed displacement 
of such small vortex blobs that, on average, should maintain the strip configuration 
for a longer time. 

In the first of our simulations, we generate at each time step M ~ 2 - 10? fixed 
vortices, with intensity p ~ 5. 107^ which follows from the heuristics. The 
fixed vortices are uniformly distributed in a strip [—1, 1] x [—0.1, 0.1], containing 
the initial point vortices configuration. This particular setup captures the feedback 
effect of small scales on large vorticity structures, as the contribution of all the 
low-intensity perturbations on the dynamics of the point vortices averages in every 
direction. In the proposed numerical simulation, we show that the transport noise 
model reproduces the desired instability delay, even if it is slightly less effective 
than in the independent noise case; we illustrate a snapshot of a configuration for 
time f — 50 and t — 100 in Fig.3a, b. 

By comparison with Figs. 1b and 2a, we see that the initial strip configuration is 
preserved for a longer time than in the deterministic case and rotation of the fluid 
is milder, but the profile is less stable than in the viscous regime. If we focus on 
the deterministic case, we see blob-like structures formation already at t = 50; in 
contrast, in the transport noise regime, those structures are less visible and appear 
more prominently only at the end of our simulation (t = 100). This delay of the 
instability is evident in the realizations in Fig. 3b, compared with Figs. 1c and 2b: 
we notice a more diffused and homogeneous profile and a delayed formation of 
rotational structures due to the noise spreading the particles along the y-axis. A 
difference with the viscous case is that the compression in the x-axis is stronger than 
in the case of the independent noise, resulting in a more prominent stretch, which 
could resemble more the deterministic formations, placing the transport noise as a 
midpoint between the two regimes. 

In the second of our simulations, the strip of fixed points Y is generated in the 
same region as the point vortices at each timestep; we selected M ~ 1.32 - 10°, the 
fixed vortices are uniformly distributed in [—1, 1] x [—6 — e, ô + €], with e = 0.03, 
and their intensity is derived from the heuristics a ~ 5. 107+. The results are 
shown in Fig. 4: diffusion on the y-axis is present for short times, and preservation 
of the strip profile is guaranteed. However, the drawback of such a configuration 
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Fig. 3 Environmental noise. 
(a): iteration t = 50, diffusive 
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Fig. 4 Environmental noise 81 
case, iteration t = 50, 

diffusion of the strip is EE 
present for a short time with 

preserved configuration 81 
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is that analysis can be performed only for a short time: boundary effects of the 
fixed vortex strip can deteriorate the configuration, making the results unrealistic. In 
future works, we expect to overcome this obstacle by proposing a new method, now 
in the study, to generate small vortices only in regions activated by the shear flow’s 
movement. 

We performed a final simulation, in which we take the density of fixed vortices to 
be smaller than the density of the point vortices. In particular, the strip of fixed points 
Y/ is generated in the same region as the point vortices at each timestep; we selected 
M ~ 10°, the fixed vortices are uniformly distributed in [—1, 1] x [—-8 — e, 6 + €], 
with e = 0.05, and their intensity derived from the heuristics aM n 5.1073. 
While the diffusive behaviour is lost, as shown in Fig. 5, the strip is already broken at 
time t = 50, showing rotating structures. The fixed vortices’ low density and higher 
intensity seem to produce new formations and medium-scale structures, showing 
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Fig. 5 Environmental noise 
case, iteration £ = 50, low 
density ratio between fixed 
vortices and point vortices 
showing emergence of 
medium-scale structures 


01 
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a completely different behaviour than the deterministic and viscose counterparts. 
For this reason, we need to investigate further the link between the ratio of vortices 
densities and the formation of new independent medium structures. 


4.3 Diagnostics 


In this section, we perform statistic analysis on the three configurations proposed 
in this study to highlight the differences and the reconstructed stability, or the 
emergence of new structures, in the Kelvin-Helmholtz instability problem. 

As a first step, we compute the vorticity c obtained through the same mollifi- 
cation as in Majda [4], through the vortex blob method applied to each of the point 
vortices Xİ. We report our results for the vorticity computed in the deterministic 
case at f = 100 in Fig. 6a. We see that the vorticity measure concentration near the 
fluid's boundary layer is located in the newly developed macroscopic structures. 
Moreover, a displacement from the initial configuration where the laminar fluid 
started its evolution is present. 

In contrast, the vortex blob solution with viscosity v > O retains its structure for 
longer times than in the inviscid case. The instability delay is graphically evident 
both from the configuration reported in Fig. 2a and the vorticity intensity reported 
in Fig. 6b: the vorticity measure concentration at time f = 100 is similar to the one 
of the initial strip but with a more diffused profile on the horizontal line. 

From configuration Fig.3b, finally, in Fig. 6c, we show the vorticity in the 
environmental noise regime at ? = 100. In contrast with the inviscid case, we see no 
development of macroscopic structures in the profile. However, even though a more 
diffused profile, with lower density overall, is present, the stability of the strip profile 
is lost at larger times compared to Figs. 6b and 2b. This instability at larger times 
suggests that different behavior, dependent on the density of fixed vortex Y/ and 
selection of transport noise fields o;, could arise in applying this kind of small-scale 
approximation. For this reason, we focus our analysis on small-time behaviour. 

Concerning the formation of large rotating structures, we see that the particles 
spread in the horizontal direction when a forcing term, either an independent or 
transport noise, acts on the fluid, in contrast to the solution of the Euler equation 


Effect of Transport Noise on Kelvin—Helmholtz Instability 47 


(c) 


Fig. 6 Vorticity we att = 100 in the case (a) inviscid, (b) viscous and (c) transport noise, showing 
macroscopic structures formation or delay of instability 
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Fig. 7 Histograms of x-positions and empirical density at ¢ = 100 in the case (a) inviscid, (b) 
viscous and (c) transport noise, showing formation of macroscopic structures 


with v — O. In particular, we focus on the empirical density obtained from the x- 
axis in the three configurations at t = 100, Fig. 7. The deterministic case of Fig. 7a 
shows a complete formation of separate blobs with peaks in the exact locations of 
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the macroscopic structures, as in Fig. 1c. On the contrary, in the viscous Fig. 7b and 
transport noise case of Fig. 7c, the distribution of the vortices is more uniform, and 
it delays the instability of the fluid layers. 

Following theoretical results, we know that with our initial condition wo, the 
velocity u; solves (21) when v > 0. This is crucial to understanding our system's 
short-time behaviour and the preservation of the initial configuration. This result 
states that the empirical density obtained from the y-position, when viscosity is 
present, maintains a Gaussian profile through time. In particular, in the case of v — 
0, the viscosity follows a classic Euler equation. As such, the y-position profile is 
far from a Gaussian: it behaves like a multi-modal distribution, concentrated in the 
proximity of the center of the large structures. This result is supported both by the 
profile of the particle system and by Figs. 8a, and 9a; the qq-plot shows a distinct 
behaviour for small quantiles. The Kolmogorv-Smirnov test estimates a D-statistic 
of 0.031, with a p-value less than 107° confirming the rejection of the Gaussianity 
hypothesis. 

Vice versa, when viscosity is present, i.e. v > 0, as in the case of independent 
Brownian motions, the noise's diffusive behaviour allows the profile's restoration 
in the y-direction: the strip configuration is preserved for a longer time. From the 
profile of the point vortex system and Figs. 8b, and 9b, we see that the empirical 
density approximates well the one of a Gaussian kernel. Moreover, the qq-plot 
suggests a perfect match with a normal distribution, suggesting the preservation of 
the strip through time, trading it with more spread on the y-axis. Finally, performing 
a Kolmogorv-Smirnov test, we see, in fact, a D-statistic of 0.005 and a p-value 
greater than 0.9 suggesting to accept the normality hypothesis. This behaviour is 


(c) 


Fig. 8 Histograms of y-positions and empirical density at t = 50 in the case (a) inviscid, (b) 
viscous and (c) transport noise, showing different density profiles 
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Fig.9 qq-plot of the empirical density in the y-positions, at £ = 50 in the case (a) inviscid, (b) 
viscous and (c) transport noise 


preserved throughout the simulation, degrading only at longer times when a few 
large formations start to rise, as in Fig. 2b. 

In the transport noise case, we know that we recover the same viscous Euler 
equation solved in the independent Brownian motion case by applying a particular 
scaling limit procedure. To this end, we expected that the more stable profile shown 
in Fig. 3a presents the same diffusion on the density of the y-position as the case of 
the independent noise. This is the case as reported in Figs. 8c, and 9c: the behaviour 
at a short times, t = 50, in which the profile still approximates a Gaussian kernel. 
We perform a qq-plot and a Kolmogorov-Smirnov test on the y-position; we obtain a 
D-statistic of 0.011, and a p-value of 0.312; those results suggest the profile stability 
and the validity of our hypothesis. 

However, later on (t = 100), even though the quantity obtained from the KS 
test is still preserved as in the viscous case, with a D-statistics of 0.008 and a p- 
value of 0.48, the profile degrades as shown in Fig. 3b. Those results show that the 
strip configuration is not preserved for longer times due to transport noise stretching 
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acting on the point vortices. This can be seen in the tail of the distribution in Fig. 9c, 
compared to the viscous case in Fig. 9b, which shows at t = 50 already a different 
behaviour. This suggests that the environmental noise's effect is not only related to 
the diffusivity of the strip, but is also responsible for the stretching and formation of 
different structures. 


5 Concluding Remarks 


In the present work, we have produced numerical simulations of 2D incompressible 
fluids, also perturbed by transport noise, using the point vortex method. We focused 
on the special case of shear flow formation that produced a Kelvin-Helmholtz 
instability, in order to test the dissipativity properties of small-space-scale transport 
noise. We confronted the intrinsic instability generated in the deterministic case with 
the possible recovery of the stability through injected noise in the system in the form 
of transport noise. We showed that, for short times, with a degree less intense than 
the viscous case, we can maintain the stability of the strip at the expense of a more 
small-scale irregularity and diffusion of the profile. 
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On the 3D Navier-Stokes Equations with A 
Stochastic Lie Transport ems | 


Daniel Goodair and Dan Crisan 


Abstract We prove the existence and uniqueness of maximal solutions to the 3D 
SALT (Stochastic Advection by Lie Transport) Navier-Stokes Equation in velocity 
and vorticity form, on the torus and the bounded domain respectively. In particular 
we demonstrate the efficacy of Goodair et al. (Existence and Uniqueness of Maximal 
Solutions to SPDEs with Applications to Viscous Fluid Equations, 2023. Stochastics 
and Partial Differential Equations: Analysis and Computations, pp. 1-64) in showing 
the well-posedness for both the velocity and vorticity form of the equation, as well as 
obtaining the first analytically strong existence result for a fluid equation perturbed 
by Lie transport noise on a bounded domain. 


1 Introduction 


The theoretical analysis of Stochastic Navier-Stokes Equations dates back to the 
work of Bensoussan and Temam [4] in 1973, where the problem of existence 
of solutions is addressed in the presence of a random forcing term. The well- 
posedness question for additive and multiplicative noise has since seen significant 
developments, for example through the works [1, 7, 24, 29, 39, 42, 43] and 
references therein. The interest in this problem has expanded into analytical 
properties of these solutions, particularly along the lines of ergodicity, which can 
be seen in [17, 18, 22, 26, 27, 34]. In the present work our concern is the Navier- 
Stokes Equations with Stochastic Lie Transport, derived through the principle of 
Stochastic Advection by Lie Transport (SALT) introduced in [35]. We consider the 
equation 


t t t 
Ur -u+ f Lu ,Us ds -v f Ausds + | Bus odW, + Vo; = 0 (1) 
0 0 0 


D. Goodair (b4) - D. Crisan 
Imperial College London, London, UK 
e-mail: djg116@ic.ac.uk 


© The Author(s) 2024 53 
B. Chapron et al. (eds.), Stochastic Transport in Upper Ocean Dynamics II, 
Mathematics of Planet Earth 11, https://doi.org/10.1007/978-3-031-40094-0 4 


54 D. Goodair and D. Crisan 


where u represents the fluid velocity, p the pressure,! W is a Cylindrical Brownian 
Motion, £ represents the nonlinear term and B is a first order differential operator 
(the SALT Operator) formally addressed in Sect.2.3. Intrinsic to this stochastic 
methodology is that B is defined relative to a collection of functions (&;) which 
physically represent spatial correlations. These (&;) can be determined at coarse- 
grain resolutions from finely resolved numerical simulations, and mathematically 
are derived as eigenvectors of a velocity-velocity correlation matrix (see [10, 11, 
12]). We pose the equation (1) in 3 dimensions and impose the divergence free 
constraint on u. We shall consider the problem both over the torus T? and a smooth 
bounded domain C IR?. In the case of the torus we supplement the equation 
with the zero-average condition (as is classical), whilst for the bounded domain we 
impose the boundary condition 


u-n-0, w=0 (2) 


where n represents the outwards unit normal at the boundary, and w the fluid 
vorticity. These are the so called Lions boundary conditions, considered in [38] and 
shown to be a particular case of the Navier boundary conditions in [36] (note that 
this is done in 2D, whilst a treatment of the Navier boundary conditions in 3D can be 
found in [28]). The significance of such a boundary condition is well documented 
in that work by Kelliher, and can be seen in other works such as [28] where the 
boundary layer is explicitly addressed. The precise mathematical interpretation of 
these conditions, and the operators of (1), are explicated in Sect. 2.2. A complete 
derivation of this equation can be found in [45]. 

This work continues the theoretical development of fluid models perturbed by a 
transport type noise, the significance of which was posed as early as 1992 in the 
paper [6]. The area has garnered substantial attention in more recent years with the 
seminal works [35, 41], in which the authors establish a new class of stochastic 
equations driven by transport type noise which serve as fluid dynamics models by 
adding uncertainty in the transport of fluid parcels to reflect the unresolved scales. 
This paper partners that of [32] where we showed the existence and uniqueness 
of maximal solutions to Stochastic Partial Differential Equations satisfying an 
abstract framework, built to cope with a general transport type noise as we see in 
(1). The importance of such equations in modelling, numerical schemes and data 
assimilation is reviewed there, along with the theoretical developments of these 
equations: let us briefly mention some interesting results [3, 5, 21, 23]. We only 
draw particular attention here to the Navier-Stokes Equations, and results on a 
bounded domain. The Navier-Stokes Equations have been studied with transport 
type noise, for example in the works [14, 19, 25, 43], though typically solutions are 
analytically weak and where strong solutions are considered major concessions in 
the noise are made. In these cases a cancellation property is evident in the noise 


' The pressure term is a semimartingale, and an explicit form for the SALT Euler Equation is given 
in [45] Subsection 3.3. 
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term, so the resulting energy balance is formally the same as for the deterministic 
equation without noise. These difficulties have been addressed on the torus, in the 
likes of the papers [13, 37] and those further addressed in [32], but extending a 
control of this noise term to a bounded domain remains open. Furthermore, whilst 
the presence of viscosity does improve the solution theory, it invites additional 
challenges in controlling the noise. Energy methods require non-standard Sobolev 
inner products to conduct the required integration by parts for the viscous term in the 
bounded domain, so we must provide novel estimates on the transport type noise in 
these inner products. The problem of analytically strong solutions to fluid equations 
perturbed by a transport type noise in the bounded domain has been considered in 
[8], though the authors assume that the gradient dependency is of a small enough 
size to be directly controlled and that the noise terms are traceless under Leray 
Projection; such assumptions are designed to circumvent the technical difficulties 
of a first order noise operator on a bounded domain, and is a luxury that the SALT 
equations do not have. 

The goal of this paper is to apply the abstract framework established in [32], 
providing a rigorous justification of the results first announced in [31] and extending 
them to the vorticity form of Eq. (1) on a bounded domain. The purpose of this is 
twofold: 


1. To demonstrate the efficacy of the criteria from [32]. This is most pertinent in our 
treatment of the velocity form on the torus; whilst the well-posedness results that 
we present are new, one would expect them to hold. This belief is compounded by 
the well-posedness results for both viscous and inviscid fluid equations perturbed 
by Lie transport noise on the torus [2, 13, 15, 16, 37] for which the Navier- 
Stokes equations do not present any intrinsic additional difficulty, though still 
contributing a necessary piece to the somewhat sparse literature. We hope to 
convey that verifying the assumptions of [32] is now the simplest and most 
efficient way to prove the well-posedness of this equation and, by extension, 
associated stochastic viscous fluid equations. 

2. To obtain the well-posedness of the equation in the presence of a boundary, which 
does present intrinsic additional difficulty beyond the existing results. 


In the interests of brevity we provide a shorter manuscript here, though greater 
detail can be found in the former arXiv version [33]. In Sect.2 we establish 
the stochastic and functional framework necessary to understand (1), along with 
fundamental properties of the operators involved. In Sect. 3 we make precise how 
equation (1) fits into the framework of [32], as a problem posed on the torus Ts. 
In Sect. 4 we consider the vorticity form of equation (1) as a problem posed on a 
bounded domain of RÌ. We again justify the assumptions in [32] to prove existence 
and uniqueness of maximal solutions to this equation. Additional details for the 
proofs are given in Sect. 5, along with the results of the partnering paper [32]. 
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2 Preliminaries 
2.1 Elementary Notation 


In the following O can represent both the 3-dimensional torus T? and a smooth 
bounded domain @ C R?. We consider Banach Spaces as measure spaces equipped 
with the Borel o-algebra, and use A to represent the Lebesgue Measure. Let (X, u) 
denote a general measure space, (Y, ||-||3)) and (Z, ||-|| z) be Banach Spaces, and 
U, C, )u). (H, C, JH) be general Hilbert spaces. O is equipped with Euclidean 
norm. 


e LP(X;)is the class of measurable p-integrable functions from ¥ into y, 1 < 
p « oo, which is a Banach space with norm 


lle = f oua. 


In particular L?(X ; V) is a Hilbert Space when ) itself is Hilbert, with the 
standard inner product 


(b age, = | 6). yoya. 


In the case X = O and y = R? note that 
3 
iolon = Ito b= (ohen) d:0R. 
"ESI 


We denote ||- ||z2(0;R3) by Il-llz and |-]H;2(0.:5; by Ill. 
e L??(X;y) is the class of measurable functions from AX into Y which are 
essentially bounded, which is a Banach Space when equipped with the norm 


loli cry) = inftC > 0: lóG)lly < C for p-a.e. x € X}. 
e L*(O; R?) is the class of measurable functions from © into R? such that 9! € 


L™(O; R) for l = 1,..., N, which is a Banach Space when equipped with the 
norm 


l 
lllz := supli || L=(0;R)- 
I<N 


e C(X; V) is the space of continuous functions from X into Y. 
e C(O; R) is the space of m € N times continuously differentiable functions from 
O to R, that is $ € C" (O; R) if and only if for every N dimensional multi index 
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a = a, ..., «y With |a| x m, D*$ € C(O; R) where D“ is the corresponding 
classical derivative operator 07, ... ƏZA. 

* C(O; R) is the intersection over all m € N of the spaces C" (O; R). 

* Co (0; R) form € Norm = ov is the subspace of C" (@; R) of functions which 
have compact support. 

e C" (O; R5), Co (6; R?) for m € N orm = of is the space of 
functions from O, @ to IR? whose N component mappings each belong to 
C" (O; IR), Cy (C; R). 

e W”?(O;R) for 1 < p < oo is the sub-class of L? (©, IR) which has all weak 
derivatives up to order m € N also of class LP (©, IR). This is a Banach space 
with norm 


lesse gt >: IDe 7: oan 


l| xm 


where D“ is the corresponding weak derivative operator. In the case p = 2 the 
space W"'?(O, R) is Hilbert with inner product 


($, V) wm.2(O:R) = 2a (D"$, D” V) rxo;g. 


lo] xm 


e w™©(O;R) for m € N is the sub-class of L??(O, IR) which has all weak 
derivatives up to order m € N also of class L??(O, R). This is a Banach space 
with norm 


ll ll ww.oe co.) :— sup ||D° $1 ,.0(0;R3): 


la | zm 
e W";P(O; R?) for 1 < p < oo is the sub-class of L? (O, R?) which has all weak 


derivatives up to order m € N also of class L? (O, IR?). This is a Banach space 
with norm 


3 
l 
len = X 10 motors 


l=1 


In the case p = 2 the space W"'?(O, R3) is Hilbertian with inner product 


3 
($, V) wma :— Lie VI) ym.2(0:R): 


e W’-%(O; R5) is the sub-class of L (O, IR?) which has all weak derivatives up 
to order m € N also of class L (O, R?). This is a Banach space with norm 
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. l 
IP ll ymo R) :— suplló I| wm.00(O;R)- 
I<N 


e L2 (T3; R3) is the subset of L2 (T3; R3) of functions $ such that 


Í pdr — 0. 
T? 


e Ww" 2 (T5; R2) is simply the intersection W"?(T?; R2) N L?(T?; R3). 

. Wo" (6; IR), Wo" (6; R3) form € N and 1 < p < œ is the closure of 
CLC; R), CX (6; R?) in W™ (6; R), W™P (0; R5). 

* Z(Y; Z) is the space of bounded linear operators from y to Z. This is a Banach 
Space when equipped with the norm 


lFllgo:z) = sup |Fyllz 
lyly=1 


and is simply the dual space * when Z = R, with operator norm ||- ||». 
e Z? (U; H) is the space of Hilbert-Schmidt operators from U/ to H, defined as the 
elements F € (U; H) such that for some basis (ej) of U, 


oo 
Yi Fell < 00. 
i-l 


This is a Hilbert space with inner product 


oo 


(F, G) men = X (Fei, Gei)g 


i=1 
which is independent of the choice of basis. 


We will consider a partial ordering on the 3—dimensional multi-indices by a < f 
if and only if for all / = 1,2, 3 we have that o; < fj. We extend this to notation < 
by a < B if and only if œ < £ and for some l = 1, 2, 3, a < fi. 

We also now introduce some less familiar spaces in slightly greater detail. We 
recall notation that ( represents a smooth bounded domain in R? which we now 
fix, T? is the 3—dimensional torus, and O freely denotes both T? and Ø. 


Definition 2.1 We define Cos (€; IR?) as the subset of CHG; R3) of func- 


tions which are divergence-free. y2 (O; R3) is defined as the completion of 
Coo, (€; R3) in L2(6; R?), whilst we introduce Wi?(o; IR?) as the intersection of 
Wy (6; R3) with L2 (6; IR?) and W2?(6; IR?) as the intersection of W??(6; R?) 
with W}? (0; R5). 


Recall that any function f € L?(T?; R?) admits the representation 
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fije > fee (3) 


kez? 


whereby each f, € C? is such that f, = f. and the infinite sum is defined as a 
limit in L?(T?; R?), see e.g. [44] Subsection 1.5 for details. 


Definition 2.2 We define L2 (T3; R3) as the subset of L? (T3; R3) of functions f 
whereby for all k € Z3, k- fy = 0 with fy as in (3). For general m € N we introduce 
w™-2(T?; IR?) as the intersection of W™? (T?; R?) respectively with L2 (T?; R3). 


Note that WŁ? (T3; R3) is precisely the subspace of W1? (T°; R?) consisting 
of zero-average divergence free functions. Similarly W?(6; IR?) is precisely 
the subspace of wE O ; R?) consisting of divergence free functions. Moreover, 
Wi?(6; IR?) is the completion of Co, (€; R?) in WH2(6; R?). The general space 
W1?(O; IR?) thus incorporates the divergence-free and zero-average/zero-trace 
condition. 

As for the stochastic set up, let (Q, F, (F;), P) be a fixed filtered probability 
space satisfying the usual conditions of completeness and right continuity. We 
take W to be a cylindrical Brownian Motion over some Hilbert Space U with 
orthonormal basis (ej). Recall ([30], Subsection 1.4) that W admits the repre- 
sentation W, = p» ej Wi as a limit in L^(Q; W) whereby the (W?) are a 
collection of i.i.d. standard real valued Brownian Motions and Sl’ is an enlargement 
of the Hilbert Space 4 such that the embedding J : í( — 4 is Hilbert- 
Schmidt and W is a J J*—cylindrical Brownian Motion over SU’. Given a process 
F : [(, T] x Q > .Z?($5 #) progressively measurable and such that F € 
L? (Q x [0, T]; . Z2? (&k; H)), for any 0 < t < T we define the stochastic integral 


t oo t 
[ Faw, = F,(ej)d Wi 
0 i= 10 


where the infinite sum is taken in L? (Q; .#). We can extend this notion to processes 
F which are such that F(w) € L? ([0, T]; Z (U; H)) for P — a.e. w via the 
traditional localisation procedure. In this case the stochastic integral is a local 
martingale in 7.7 


? A complete, direct construction of this integral, a treatment of its properties and the fundamentals 
of stochastic calculus in infinite dimensions can be found in [30] Section 1. 
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2.2 Functional Framework 


We now recap the classical functional framework for the study of the deterministic 
Navier-Stokes Equation. Firstly we briefly comment on the pressure term V p, which 
will not play any role in our analysis. p does not come with an evolution equation 
and is simply chosen to ensure the incompressibility (divergence-free) condition; 
moreover we will ignore this term via a suitable projection (in Sect. 3 we even 
consider a different form of the equation) and treat the projected equation, with 
the understanding to append a pressure to it later. This procedure is well discussed 
in [44] Sect. 5 and [40], and an explicit form for the pressure for the SALT Euler 
Equation is given in [45] Subsection 3.3. 

The mapping £ is defined for sufficiently regular functions f,g : O > IR? 
by Leg := X5 fi 9;g. Here and throughout the text we make no notational 
distinction between differential operators acting on a vector valued function or a 
scalar valued one; that is, we understand 0; g by its component mappings (ag)! = 
9j g!. We now give some clarification as to ‘sufficiently regular’, by stating basic 
properties of this mapping. For any m > 1, the mapping £ : W"* ?(O; R?) > 
W"2(O: R?) defined by f œ> £ ff is continuous. Additionally there exists a 
constant c such that for any f, g € W'^?(O; R?) for k € N as appropriate, we 
have the bounds: 


IE pell + Le fll S ellgllwiell fllw22: (4) 
ILefllwi2 < cllgllwizlfllwsa: G) 
ILe f llwi2 < ellgllw22ll fll w22- (6) 


We introduce the Leray Projector P as the orthogonal projection in L^(O; R?) 
onto T (O; IR?). It is well known (see e.g. [47] Remark 1.6.) that for any m € N, 
P is continuous as a mapping P : W"?(O;R?) — W"-2(O; R2). In fact, 
the complement space of 12 (©; R?) can be characterised (this is the so called 
Helmholtz-Weyl decomposition), a result that we state explicitly as we will need 
to exploit the precise structure in future arguments. 


Lemma 2.3 Define the space 
L270: R’) := (y € L(O; R):y = Vg for some g € W'?(O; R)}. 


Then indeed L21 (0; R3) is orthogonal to i (O; R2) in L?(O; RÌ), i.e. for any 
óc E (O; R5) and y € LL IR3) we have that 


($, V) — 0. 


Moreover, every f € L*(@; RÌ) has the unique decomposition 


f-éótv (7) 
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for some $ € LŽ (0; R3), we i2 R3) and every f € L?(T?; Rẹ?) has the 
unique decomposition 


f=otyrte (8) 


where $ € i (T?; R), y € Ler: IR?) and c is a constant function: that is, 
there exists k € IR? such that each component mapping c is identically equal to k, 
Jj =1,2,3. 


Proof See [47] Theorems 1.4, 1.5 and [44] Theorem 2.6 . o 
Corollary 2.3.1 Every f € L?(@; R?) admits the representation 


f-Pf*Vg (9) 


for some g € W!?(@;R). Similarly every f € L*(T?; R?) admits the representa- 
tion 


f=Pf+Vg+c (10) 


for some g € W!?(T?; R) and constant function c. 


Proof Itis an immediate property of the orthogonal projection that P f is the unique 
element $ € L2 (O; R?) of (7) and (8). n 


Through P we define the Stokes Operator A : W?2(O; R?) > ps (O; R?) by 
A :— —PA. Once more we understand the Laplacian as an operator on vector 
valued functions through the component mappings, (Af)! := Af!. From the 
continuity of P we have immediately that for m € {0} UN, A: W?"*22(O; R?) > 
W"2(O: R?) is continuous. We remark that A leaves the complement space 
Lo IR?) invariant, so AP is equal to A on W?2(O; R?). Moreover (see [44] 
Theorem 2.24) there exists a collection of functions (az), ak € Wi?(O; R3) N 
C% (O; R?) such that the (ag) are eigenfunctions of A, are an orthonormal basis 
in L2 (0; IR?) and an orthogonal basis in W1?(O; IR?). The eigenvalues (Àg) are 
strictly positive and approach infinity as k — oo. Therefore every f € i (O; R3) 
admits the representation 


f=} fea (11) 
k=1 


where fy = (f, ax), as a limit in L? (O; Rẹ). 


Definition 2.4 For m € N we introduce the spaces D(A"/?) as the subspaces of 
L2 (O; R?) consisting of functions f such that 
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oo 
2 
a k fk < œ 


k=1 


for fk as in (11). Then A”/? : D(A"/2) — [2 (O; R3) is defined by 


oo 
ani peo YAT f 
k=1 


We present some fundamental properties regarding these spaces, which are justified 
in [9] Proposition 4.12, [44] Exercises 2.12, 2.13 and the discussion in Subsection 
2.3; 


1. D(A"P?) c w"2(0: R?) n WH?(O; R?) and the bilinear form 


(faa im (APP f, ATI) 


is an inner product on D(A"/2); 
2. For m even the induced norm II-Il2, = (-,-)m is equivalent to the W™? (O; R?) 
norm, and for m odd there is a constant c such that 


Il-lwm2 < ell-llns 


3. D(A) = W2?(O; R3) and D(A!/*) = W}? (O; R?) with the additional property 
that ||-||; is equivalent to ||-|| i» on this space. 


It can be directly shown that for any p,q € N with p < q, p+q = 2m and 
f € D(A"/5), g € D(A4/) we have that 


(f, g)m = (APP? f, A1 g). (12) 


From here we can also see that the collection of functions (ag) form an orthogonal 
basis of W1?(O; IR?) equipped with the (-, -); inner product. In addition to using 
these spaces defined by powers of the Stokes Operator, we also use the basis to 
consider finite dimensional approximations of these spaces. 


Definition 2.5 We define P, as the orthogonal projection onto span(a|, ..., an} in 
L?(O; R?). That is 7, is given by 


n 


Pa: f > D, Ak) Ak 


k=1 


for f € L?(O; R?). 


The restriction of P, to D(A"/2) is self-adjoint for the (-, -),, inner product, and 
there exists a constant c independent of n such that for all f € D(A"/2), 
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Pn fll ym. < cll f llwn, (13) 


see [44] Lemma 4.1 for details. Similar ideas justify that for all f € W!:7(O; R°), 
g € W2?(O; R5), 


1 1 
IU Pad fl? < cif, I~ Pagik < = igl 


where J represents the identity operator in the relevant spaces. To conclude this 
subsection we discuss briefly bounds related to the nonlinear term, which will be 
used in our analysis. For every $ € WŁ? (O; R?) and f, g € W?(O; R?), we have 
that 


(Lo f. 8) = —( f, Loa) (14) 
(£o f, f) — 0. (15) 


2.3 The SALT Operator 


Having established the relevant function spaces and some fundamental properties of 
the operators involved in the deterministic Navier-Stokes Equation, we now address 
the operator B appearing in the Stratonovich integral of (1). As in [30] Subsection 
2.2, the operator B is defined by its action on the basis vectors (e;) of U. We shall 
show in Sect. 3.3 that B does indeed satisfy Assumption 2.2.2 of [30] for the spaces 
to V, H, U, X to be defined. With the notation of [30], each Bj is defined relative to 
the correlations &; for sufficiently regular f by the mapping 


Bi: f e Laf t Tif 
where £ is as before, and 7 is a new operator that we introduce defined by 


3 


Tf = 9, fI vgl. 


j=l 


We shall assume throughout that each £; belongs to the space W1?(O; IR?). If for 
some fixed m € N we have é; € w'+2.00 (2; RÌ) then for all k = 0,..., m + 1, 


ITa f lia < elei dpi oe If Mi (16) 


Le fll oa < clé pi MF Moya (17) 


NBs Flu < ehil f oaa (18) 
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Moreover Tg, is a bounded linear operator on L? (O; R?) so has adjoint Tg 
satisfying the same boundedness. In conjunction with property (14), £e; is a densely 
defined operator in L?(O; R?) with domain of definition W'?(O; IR?), and has 
adjoint Le in this space given by —Lz, with same dense domain of definition. 
Likewise then B* is the densely defined adjoint —Lg, + Te Our techniques centre 
around energy estimates, where the key idea as to how we preserve these estimates 
in the case of a transport type noise owes to the following proposition. 


Proposition 2.6 There exists a constant c such that for each i and for all f € 
W**22(0; R5) with k = 0,..., m, we have the bounds 


(B? f, f)wka + Bi f Moya < elé yezo f logia: (19) 
(Bi f. FY gna < léi liye oll flyer (20) 
Proof See Sect.5.1. o 


Another valuable result is given now, which will be necessary in showing 
comparable estimates to Proposition 2.6 in the (-, -)x inner product for appropriate 
k. 


Lemma 2.7 We have that 
B; : LE: (0; R?) n W'?(O; R°) > L3+(O; R°) 


and moreover that P B; = P B;P on W?(O; RÌ). 
Proof See Sect. 5.1. o 


We note that this result holds true only in the presence of the additional 7z, term 
in the operator, highlighting the significance of considering a noise which is not 
purely transport. The Leray Projector does pose difficulties in the presence of a 
boundary though, which we state here. 


Remark 2 The Leray Projector does not preserve the space Wa 20 ; R3), and so 
we cannot say that PB; : W2?(6; R3) > W1?(6; R3). The issues arising from 
this operator not satisfying the zero-trace property are fundamentally why we only 
treat the Torus for the velocity form in Sect. 3. 


3 The Velocity Equation on the Torus 


In this section we restrict ourselves to the Torus T?, leaving a treatment of the 
bounded domain to Sect. 4. We also now fix our assumptions on the (&;), assuming 
that each &£; € WI?(T?; R?) N W?-(T?; R?) and they collectively satisfy 
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oo 
NETTES Q1) 
i=l 


3.1 Definitions and Results 


Here we state the key definitions and results of this section. To facilitate our analysis 
we work with an equation projected by the Leray Projector as discussed at the start 
of Sect. 2.2. Thus we consider the new equation 


t t t 
uy — uo +f PLu, us ds + vf Au, ds +f PBus o dW, = 0 (22) 
0 0 0 


obtained at a heuristic level by projecting all terms of (1). Having not defined 
solutions of (1) we cannot be too formal here, but the idea is that we require solutions 
in L (O; R?) with initial condition also in this space so they are invariant under P, 
and P is a bounded linear operator so can be taken through the integrals (see [30] 
Corollary 1.6.12.1 for this result in Itó integration, understanding the Stratonovich 
integral as the sum of an Itó and a Bochner integral). 


Theorem 3.1 For any given J9—measurable uo : Q > W2?(T*; R2) there exists 
a pair (u, T) such that: t is a IP — a.s. positive stopping time and u is a process 
whereby for P — a.e. œ, u.(@) € C ([0, T); W2?(T*; R3)) and u.(w)l.<r(w) € 
L? ([0, T]; W2?(T*; R5) for all T > 0 with u.1.<; progressively measurable in 
W2-?(T?; R?), and moreover satisfying the identity 


[AT [AT [AT 
Ut = ug— f PLYy us ds — vf Aus ds — 1 P Bus o dW, 
0 0 0 


IP — a.s. in L2 (T?; RÌ) for all t > 0. 
Theorem 3.1 will be proved as a consequence of the following proposition. 


Proposition 3.2 Suppose that (u, t) are such that: t is a P — a.s. positive stopping 
time and u is a process whereby for P — a.e. œw, u.(w) € C (I0, T]; W2?(T*; R3)) 
and uw)l.<rw) € L?([0, T]; W27(T?; R?)) for all T > O with u.l.<r 
progressively measurable in WP IR?), and moreover satisfying the identity 


[AT [AT 
Ut =w- f PLy us as—v | Aus ds 
0 0 

oo 


tAT tAT 
ER J X PB?usds -f P Bu,dW, 
0 i=l V 
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P — a.s. in WŁ? (O; R3) for all t > 0. Then the pair (u, x) satisfies the identity 


tAt tAt tAt 
uj = uo— f PLYy,Us ds — vf Aus ds — Í P Bu; o dW, 
0 0 0 


P — a.s. in Le: R3) for all t > 0. 


Proposition 3.2 motivates studying the converted equation 


t t 1 t oo t 
ur = w- | PLu, ts ds — vf Au; ds + ;] y PB?usds -f PBusdW,. 
0 0 emer 0 


(23) 


Once we convert to the Itó Form though, starting with an initial condition in 
w2?(T*; IR?) is not optimal in the sense that, at least according to the deterministic 
theory, we should be able to construct a solution (satisfying the identity in 
L (T?; R?) as is natural) for only a wi? (T°; IR?) initial condition. To this end 
we give the following definitions and the main result of this section. Definition 3.3 
is stated for an arbitrary Jo —measurable ug : Q > w1?(T*; R3). 


Definition 3.3 A pair (u, t) where t is a P — a.s. positive stopping time and 
u is a process such that for IP — a.e. w, u.(w) € C (LO, T]; Ww (T3; R3)) and 
u.(w)l.<r(œ@) € L? ([0, T]; Ww2?(T*; R3)) for all T > O with u.1.<; progressively 
measurable in W2?(T*; IR?), is said to be a local strong solution of the equation 
(23) if the identity 


[AT [AT 
Ut =w- | PLu, us as—v | Aus ds 
0 0 

1 oo 


[AT [^T 
ac al X PB?usds -f P Bu,dW, (24) 
9  j2 8 


holds P — a.s. in L2 (T?; R°) for all t > 0. 


Remark 3 If (u, x) is a local strong solution of the equation (23), then u. = u.,z. 
The time integrals in (24) are well defined as Bochner integrals in L2 (T?; R3), and 
the Itô integral in wr ; R3). These integrals are shown to be well defined in the 
abstract case in [30] Definition 2.2.3. 


Definition 3.4 A pair (u, ©) such that there exists a sequence of stopping times (0;) 
which are P — a.s. monotone increasing and convergent to ©, whereby (u.,;, 0;) 
is a local strong solution of the equation (23) for each j, is said to be a maximal 
strong solution of the equation (23) if for any other pair (v, r) with this property 
then © <T P — a.s. implies © =T P — a.s.. 


Definition 3.5 A maximal strong solution (u, ©) of the equation (23) is said to be 
pathwise unique if for any other such solution (v, I), then © = F P — a.s. and for 
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all t € [0, ©), 
P(w € 2: ulo) = v;(@)}) = I. 


Theorem 3.6 For any given Fo— measurable uo : Q > wa: R3), there exists 
a pathwise unique maximal strong solution (u, ©) of the equation (23). Moreover 
at P — a.e. w for which O (w) < œ, we have that 


O(@ 


) 
sup lus CI + f lu, (w) 3dr = oo. (25) 
) 


re[0,O(w 


3.2 Operator Bounds 


In this subsection we state some intermediary results regarding control on the 
operators involved. In the following and throughout this section c will represent 
a generic constant changing from line to line, c(e) will be a generic constant 
dependent on a fixed e, f and g will be arbitrary elements of W22(T*; IR?) and 
Sn € spaníai, +- , an}. The proofs of these lemmas can be found in Sect. 5.1. 


Lemma 3.7 For any e > 0, we have that 


|(PnPL s, fn» fado) € cL falls + ell fall: (26) 

(Pn PB? fa, fn)i + (PnP Bi fa, PnP Bi fa) < CONE Gya.coll full} 
+ ellé l?s. ll fallo: (27) 

(PnP B? fa, fÍn)2 +F (PnP Bi fn, PaP Bi fn)2 < c(e Ell ys sel fo 
+ ellé; Moyses fall3- Q8) 


Remark 4 The algebra property of W*:?(T?; R3) when k = 2 is fundamental in 
proving (26); a result of the form 


|(Pu PL fy fas fai] < cH fall + ell fall 


is unavailable to us given that the algebra property does not hold for k = 1. This is 
revisited in Remark 5. 


Lemma 3.8 For any e > 0, we have that 


(PLy f — P£sg. f- i| < cl) (Ilt + IF) If — el + ell f — ali. 
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Lemma 3.9 For any e > 0, we have that 


IP£gf —PLeg, f —g)| < eI FIRS — ell? + elf — gl? 


3.3 Proof of Proposition 3.2 


We prove this result through the abstract procedure used in [30] Subsections 2.2 and 
2.3, the result of which is stated in Sect. 5.2. Towards this goal we define the quartet 
of spaces 


V = WÈ’ (T?; RÌ), ewe), 


U := WE? (TÌ; R°), X := LE (T?; R°). 


We equip E (T?; R?) with the usual (-,-) inner product, but then equip 
w1?(T*; R3) and we (T?; R2) with the (,-) and (-,-)2 inner products 
respectively. In fact we also have that D(A??) = wè? (T?; IR?) and that the 
(-,-)3 inner product is equivalent to the usual (-, -) y3,2 one on this space (see [44] 
Theorem 2.27), so we endow W2?(T?; R3) with (-, -)3. Our SPDE (22) takes the 
form of (68) for the operators 


Q := — (PL + vA), G := —PB 


where we rewrite (—P B;)? as PB? firstly from the linearity of P B; to deal with the 
minus and secondly using the property that P B; = P B;P. It is worth appreciating 
here that we chose to project the equation and then convert it into Itó Form, 
but we may equally have chosen to convert the unprojected Stratonovich Form 
and then project the resulting Itó Equation. Without addressing the conversion 
of the unprojected equation in complete detail, we would directly arrive at (23) 
taking this approach. Indeed before projection our correction term would be of 
the form » 5, B? plus a function in LZ-«(T?, R?) as defined in Lemma 2.7. 
Under projection this is P 3 ?*, B? which is just $72} PB? from the linearity and 
continuity. Therefore the property PB; = PB;P from Lemma 2.7 establishes the 
consistency between these approaches. 

To prove the result we check the Assumptions 5.3 and 5.4. Starting with 5.3, 
we first of all have that vA is continuous from wè? (T3; R?) into wi (T°; R3), 
therefore it is measurable and as a linear operator too satisfies the boundedness. As 
for PL, measurability is satisfied in the same way and for the boundedness we have 
that 


IPZ£rfli S cl/PEgfliwi2 S elEs flw < ell fllweall fllws2 < ell fllill flls 
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for any f € wè 2 (T3; R?) where c is a generic constant, critically applying (5). This 
verifies Assumption 5.3 so we move on to Assumption 5.4, which is immediate from 
(18) and the linearity of PB to show continuity in all relevant spaces. 


3.4 Proofs of Theorems 3.1 and 3.6. 


We use the abstract results of [32] stated in Sects.5.3 and 5.4. Defini- 
tions 3.3, 3.4, 3.5 and Theorem 3.6 are precisely Definitions 5.19, 5.20, 5.21 and 
Theorem 5.22 for the equation (23) with respect to the spaces V, H, U, X as defined 
in Sect. 3.3. Indeed we would also prove Theorem 3.1 through Proposition 3.2 by 
showing the existence of a local solution with the regularity of Definition 5.12. 
Therefore we prove both Theorem 3.1 and 3.6 by showing that the assumptions of 
Sects. 5.3 and 5.4, are satisfied. We work with the operators 


P Vou 
A= —(PL+vA)+ z 9 PB, G:=—PB 


i=l 
which were addressed to be measurable mappings into the required spaces in 
Sect. 3.3. We now prove Theorem 3.1 by justifying the assumptions of Sect. 5.3. 


Proof of Theorem 3.1 First note that the density of the spaces is immediately 
inherited from the density of the usual Sobolev Spaces and the equivalence of the 
norms. The bilinear form satisfying (70) is chosen to be 


(f. g)uxv = (A'P f, ATP g) 
which reduces to the (-, -)2 inner product from (12). The remainder of the proof is 
in treating the numbered assumptions. 


Assumption 5.6: We use the system (ag) of eigenfunctions of the Stokes Operator. 

Assumption 5.7: Once more (74) follows from the discussion in Sect. 3.3. For (75) 
we treat the different operators in A individually, starting from the nonlinear 
term: 


IP rf —P£sgli— IPLE — 8) + P£r-sgli 
< IPLE — eli PE r-sglh 
< cle pCf = Ollwi2 + cll£p—ggll wie 
< cl fllwi2Wf — gllws2 + cll f — £lwiallgllws2 
< c (Ilf lli + Ilgllws2) f£ — &llwsa 
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s cQ fli Nella) lf — ells 


having applied (5). From the linearity of vA and 1 bao PB? then the corre- 
sponding result follows immediately from (74) and this subsequently justifies 
(75). Additionally (76) follows immediately from the already justified Assump- 
tion 5.4. 

Assumption 5.8: (26) and (28) will be our basis of showing (77). The task is to 
control 


1 = 2 = 2 
a(r; (n E T) Fas n) + 2 IP PB: Sul 
i= 2 I 


which we rewrite as 
— (Pa PE fa fn. fn)2 — 2V (Ph Afn, fu)2 


+ Y (PaP B? fa, fio IPP Bi full) - (29) 


i=1 
Recalling the assumption (21) and (26), (28), we have that for any £ > 0, 


Q9) < —2v(PnAfn, fu)2 + CON falla + elfali 


Y (eMe. + eli ys fal) 


i=1 


= —2v(Afn, fn)2 + [eons + c(e) NI 


isl 
oo 
+e | + ATUM ATE 
i=l 
< -20 (APA fn, A? faya + ec) [1 + M8 | 


+e | + ATUM IAE 


i=l 


= 2v fali ec [1 + I fall] A+ e [ J MI E 


i=l 


where we have embedded the X illé; ee into the constant c(e). Therefore 
by choosing 
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v 


LE MEI?) 3,00 


then (77) is satisfied for x := v. Moving on to (78), we are interested in the term 


2 (Pa P Bi daduis 


— 


Observe that 
(PaP Bi fs, fa) = (P Bi fa, fa)? 
= (PIA, Bil fa + PB; ^ fa, Af Y, 
< 2(P[A, Bil fa, Afn)? -2(PBi fn, AfaY 
= 2([A, Bil fn, Afay + UBA fa, Afay 


The first of these terms is dealt with through a simple Cauchy-Schwarz, as 
(^, Bil fa, Afa? < MA, BAA ARM < clé lys l falls 
using Proposition 5.2, and the second comes directly from (20) as 
(BiA fn, Afn)” < cl&illyi coll Afall” < cl&ill ya.coll fal. 


Summing up the two terms and over all 7 gives that 


DA (PnP Bi fas fa) < «(x BUE ie I full3 


i=l 


which justifies (78) and Assumption 5.8. 
Assumption 5.9: For (79) we must bound the term 


iE jc 
2{(-Pe-va+ 5 5-78") f ( PL vat spa) ssa) 


i=l i=l 


oo 
+ Y IPB: f -PBigl 
i=l 


which we simply rewrite as 


—2(PLy f — PL8, f — 8g) — 2v(A(f — 8), f — 8A 
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+ Y (PEG - 8), f- ei IPB — oW) 


ee 


= 


l 


and inspect the distinct items individually. Firstly from Lemma 3.8 we have that 
for any € > 0, 


-2(P£; f —PLeg, f — gh < cC) (lied + FI) If — el elf — ell. 
(30) 


Similarly to the justification of Assumption 5.8 we also see that 


— 2v(A(f — 8), f — g)1 vll f — ella. (31) 


Shifting focus to the final term, note that in (27) we in fact showed that 


(PB? fas fa) + (PBifa,P Bi faja < clé saec ll falli + elé: yao ll ful 


and scanning the proof we see that all arguments hold for arbitrary f, € 
W2?(T*; R3) so we can deduce directly the bound 


(PB — 8), f —8)1 + IPEE — 9I S cO olf- gilt 


+ ellé: lys llf — ella 32) 


Summing over (30), (31) and all i in (32), we deduce a bound by 


— 2vll f — ell + cle) L +III + AM If- ell 


i=l 


+e i + TM If — &lb 


i=1 


so again a choice of 


v 
E = (33) 
1+ 2% NEI? 5,20 
ensures (79) is satisfied for x :— v. Moving on to (80), we are interested in the 
term 
oo 
y PBG-29. f — 8) (34) 
i=1 


noting that 
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(PB — 8), f - yi = (APBi(f — 8), f — 8) 
3 
P» 9,BiCf — 8), 9k Cf — Y. 


We have 


Ok Bi(f — 8) = Bag(f — 8) + Bidk(f — 8) 


So 


(Ox Bi(f — 8), (Cf — 9), < 2(Bae, (f — 8), KF — 9) 
T2(Bi!kCf — 8), f — 9). 


Now from (18), 


(Bae; (f — 9.9 — 9) < Bae: Cf — 8I le(F — II 
< ellé: lys. f — ell 


and from (15), 


(Bide f — 8), lF — Y = (Te def — 8), lF — 9)Y 
< cll&iljpacoll f — gilt. 


By summing both terms, over all k = 1,..., N and i € N, we have shown that 


2 PAGS 8) f- os (ei 1- el (35) 


i=l 


demonstrating (80) and hence Assumption 5.9. 
Assumption 5.10: For (81) we must bound the term 


1 c 
(one LL T) f. ] + IPB: fll 


i=l i-l 
which we simply rewrite as 


oo 


-XP£ f, fi - WAS, fi Y (PBE Pn + PBST). (36) 
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The nonlinear term can be controlled precisely as seen in Lemma 3.8 to deduce 
that for any & > 0, 


PLs f. Pul s NFI + ell FIG. 
Meanwhile across (31) and (32) we have that 


= 2v(Af, fji + Y (PB? f, fi IPB: FIR) 


i=l 


3 
< —2v| f Il + c(e) [Soi Isi 


i=l 


+e PM ME 


i-l 
so with the familiar choice of & (33) we see that 
(36) < c (1 17) - rid 
which is more than sufficient to show (81). (82) follows immediately from (35), 


concluding the justification. 
Assumption 5.11: For any 7 € w2(T?; IR?) we must bound the term 


Eo og lec s 
(roe neon) (-re vA+ >) PB; g.n 
1 


i=l i=l 


which we simply rewrite as 

oo 
= (PLs f — PLgg, mi — (AC — 8), ni + > (PBA — 8). mh 

i=l 

and further by 
oo 

—2(P£ s f — PLgg, An) — 2v(A(f — 8), An) + 9 (PB? (f — 8), An). 

i=l 

Through Cauchy-Schwarz this is controlled by 


llla (unc; - PLegll + 2vIAG — gl + DIP BPE - 2 


i=1 
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so our problem is reduced to bounding the bracketed terms. The linear terms are 
trivial when recalling (18), and for the nonlinear term we revert back to (4) to see 
that 


IPC rf —P£sgll S Ef- fll + IE — 8l S cfl + lel If — all 


comfortably justifying the assumption. 
oO 


Proof of Theorem 3.1 The new space X will again be L2 (T?; R3) as laid out in 
Sect. 3.3. We choose the bilinear form (-, -) x « g to be given by 


(f; 8)xxH :— (f, Ag) (37) 


noting that the property (86) follows from (12). Noting also that the system (ax) 
is an orthogonal basis of wt? (T°; IR?), and that the operators were shown to be 
measurable into the relevant spaces in Sect. 3.3, we are in the setting of Sect. 5.4. 
We now proceed to justify the assumptions required to apply Theorem 5.22. 


Assumption 5.16: This follows identically to Assumption 5.7, referring again to 
Sect. 3.3 and (4). 
Assumption 5.17: Continuing to consider the distinct terms, we have that 


-2v(A(f — 8). f — 8) = Mf — gli 


and 


(PB?(f — 8), f — 8) + IPB: - 9)? 
< (Bf — 8), f — 8) + IBF — 9I? < ellé: l?a f — gl. 


from (19). With these components in place, the proof of (89) then follows 
identically to that of (79). (90) is a direct consequence of (20), concluding the 
justification. 
Assumption 5.18: This stronger Assumption was in fact already verified in the 
address of Assumption 5.10. 
oO 


4 The Vorticity Equation on a Bounded Main 


In order to address the well-posedness problem of the SALT Navier-Stokes Equa- 
tions on bounded domains, we now pose it in vorticity form. The analysis conducted 
in Sect.3.4 was done with reference to the properties derived across Sects. 2.2 
and 2.3, applicable to the bounded domain as well as the torus. The issue in studying 
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the velocity form is that our operators do not map into the correct spaces in order 
to use these properties: in particular, the Leray Projector does not preserve the zero 
trace property and so the operators do not map into the necessary WEO ; R3) 
spaces (see Remark 2). The motivation behind the vorticity form is to circumvent 
the necessity of Leray Projection. 

Our attentions shall be decidedly on the bounded domain @, though the results 
for the vorticity form carry over seamlessly to the torus T?. For this section we 
impose new constraints on the £j, which are such that for each i € N, & € 
Wi?(6; R3) N wẹ? (O; R?) N W? (0; R?) and they collectively satisfy 


oo 
X léo < 00. (38) 
i=] 


4.1 Deriving the Equation 


The vorticity form of the equation is derived through taking the curl of the velocity 
form, where the curl operator is defined for f € W!?(@; R?) by 


d2 f? — 94 f? 
cuf := | a3 f! — a, f? 
9f? — 9, f! 


We introduce the Lie Bracket operator .Z/ defined on sufficiently regular functions 
f.g: 06 — R by 


Lpg := Lg — Lef. (39) 
In [44] Subsection 12.1 it is shown that, with notation $ := curl f, 

curi (Cy f) = £j 
where it is also observed that the curl of elements of W+? (6; R?) N Lo ; R3) 


is null. It is clear that the Laplacian commutes with the curl operation, and we now 
consider how the curl operation interacts with the stochastic term. 


Lemma 4.1 We have that 
curl(B; f) = Zao (40) 


where once more $ := curl f. 


Proof We shall show only that the identity (40) holds in its first component, with 
the others following similarly. To this end we calculate the first component of the 
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left hand side of (40): 


[curl(B; f)]' = à»[B; f — LB; f P 


3 3 
o [Y 80; P + Pa | 2| 3283; f? + fling! 


j=l j=l 


3 
= Y (wea; P? eoo; P? + afiat] + fiaa) 


j=l 


(asia + &/ 30; f? + às f^ 06) + fiai) 


3 


J 
3 
= Y (e/o; ? + 6/000; + af ati — 036/ 9; 7? 


j=l 


—&/ 939; f? -af aaki) 


3 


“Ls 18; (82 f? — a3 f? «Y (on) 0; f? 02/056] 


j=l j=1 


Cogia- apial) 


= [£a] 


i 


«Y (as 8; f? + aaf 056] — ajaj f? — as flop]. 


j=l 


Therefore it remains to show that 


3 
^ (aslap + dof asti calo — a3 f/08)) = Lyi’. aD 


j= 


a 


We expand the sum in (41) to 
(02101 f? + a2'996} — 036! f? — ds fn) 
+ (828P f? + 92,f79a6? — aid f? — a3 fioi) 
+ (028703 f? + 02 F036) — 9620 f? — 05 098?) 


achieving some immediate cancellation in the second two brackets to 
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(24131 f? + df! ase} — S6] f? — os foi) 
+ (806702? — 05 F028?) + (8.056? — 056703 f?) . 
We now simply rewrite the above by combining like terms, into 
825; (91 f? — 0f!) + 838} (02 f! — 81/7) + (0087 + 0367) 2 P? — 037) 
or more succinctly as 
— di] d^ — 33E d + (rk; + DEPO 


to which we add and subtract 0,19! to arrive at 
3 3 
j=l j=l 


The first term is precisely -[£s&]! as we wished to show, appreciating that the 
second term is zero given the divergence free condition on £j. o 


From this point forwards we adopt the notation of .Z; := .Zz. Writing the 
Stratonovich integral of (1) in its component form over the basis vectors of 4, and 
introducing the notation w :— curlu, at a heuristic level we can take the curl of (1) 
to obtain 


t t oo t 
Ww, — w+ f Lu, Ws ds — vf Audi Y. f .Ziw odW; — 0. (42) 
0 0 21 40 


Having already rigorously demonstrated the It6 conversion of the velocity form 
(22) we shall make the conversion without explicit reference to the conditions of 
Sect. 5.2, though this can be precisely shown in the same way. At least again then at 
the heuristic level, the Itó form is 


t t ] (S Om. opt . 
wt =w- | Lawsdstv f Aw dee s | 3 Huas— >> f -ZiwdW; 
0 4 9 j=1 i=1 "0 


which can again be projected? to the equation 


3 Although the pressure term is eliminated, we still consider the Leray Projection to ensure that all 
terms belong to Li (6; R5). 
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t t 
wr, = wo — f PLu, ws ds — vf Aw; ds 
0 0 


1 oft oo 9o t : 
«zl Y PZPuds— >> | P.ZiwdW;. (43) 
0 j=1 imi“? 


Having motivated this section by an avoidance of the Leray Projection this may 
seem counter intuitive, however we shall shortly show that the projection is not felt 
in the noise (where it becomes problematic in velocity form); that is to say that for 
sufficiently regular w, Pw = w. The goal is to deduce the existence of a 
unique maximal solution of (43), a task which requires some clarification. Having 
reached (43) from the velocity form, we now look to solve the equation for vorticity 
which demands a representation of the velocity u in terms of the vorticity w. For 
this we quote Theorem 1 of [20] (or in fact, a slightly relaxed version). 


Theorem 4.2. There exists a mapping K : € x © — R? such that for every $ € 
W1?(6;R5)nW*'P(6;R?)wherek e NU(0], 1 < p < oo, the function BS : 
O — R? defined X — a.e. by 


(BSk)(x) = | Ke. oot (44) 


is such that: 


1. B8Skó € L2 (6; R n Wt! (6; R5); 
2. curl(BSx¢) = ¢; 
3. There exists a constant C independent of $ (but dependent on k, p) such that 


IBSk ll wee < Clléllwer- 


It should immediately be noted that such a K is not claimed to be unique, and 
in [20] is explicitly shown to be non-unique, however it does allow us to identify 
a velocity from a given vorticity satisfying the divergence-free and boundary 
conditions (2). From this point forwards we fix a specific K from the class of 
admissable integral kernels postulated in Theorem 4.2. We thus understand the 
nonlinear term as a mapping 


$ — LBS 9- 


This mapping shall at times be simply denoted by “gs, . The Eq. (43) is thus closed 
in w. 
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4.2 Definitions and Results 


We now state and prove the existence, uniqueness and maximality results for (43). 
We recall that to solve the velocity form (23) we used the extended criterion of 
Theorem 5.22, requiring the space V := wè? (T? ; IR?) to prove Theorem 3.6. 
This arose naturally in first showing Theorem 3.1, where we considered solutions 
explicitly in terms of the original Stratonovich form. For (43), however, solutions 
can be obtained for the natural choice of wo € W1?(6 ; R?) with an application 
only of Theorem 5.15 in Sect. 5.3 (see Remark 5). We note that such a choice is 
natural as the identity is satisfied in L?(6; R3). Thus we do not take the detour of 
considering a fourth Hilbert Space to rigorously define solutions of the Stratonovich 
form (42), although this can be done similarly. Notions of local strong solution, 
maximal strong solution and pathwise uniqueness can then be defined identically to 
Definitions 3.3, 3.4 and 3.5 for the new identity (43). The result is then the following. 


Theorem 4.3 For any given Fo— measurable wo : Q —^ W1?(6 : R3), there exists 
a unique maximal strong solution (w, ©) of the equation (43). Moreover at P — a.e. 
w for which ©(@) < oo, we have that 


Q(o) 


sup lw (o)l? + f! EE AE 
re[0,8(0)) 0 


4.3 Operator Bounds 


Just as in Sect.3.2, we state some intermediary results regarding the operators 
involved. In the following and throughout this section c will represent a generic 
constant changing from line to line, c(s) will be a generic constant dependent 
on a fixed e, $ and v will be arbitrary elements of W2?(o; R?) and $, € 
span{a1, --- , an}. The mapping .Z; satisfies the same boundedness as (18), and also 
the following. 


Lemma 4.4 There exists a constant c such that for each i and for all f € 
w*+2.2(0; R5) with k = 0, 1, we have the bounds 


CZ? f, Fyw M Spee S clé opens lf Mia (45) 
Uf. fissa < CMG Fisica lF Moya: (46) 
ILA, 1f IP < chi lill f Moa (47) 


where [^, £] is the commutator 


[A..Zi] :2 A.Zi — ZA. 
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Proof See Sect.5.1. o 
Lemma 4.5 For any ¢ > 0 we have the bound 
(P. PL n , $n) 1+ (PaP Z; $n , PiP Ži Pn) 1 


2 2 2 2 
< ele) léi liy. llón WT + Elé: ys se ll nll. 


Proof This now follows precisely as for (27). o 


Lemma 4.6 For any ¢ > 0 we have the bound 


(PP. 5s, dn. Ón)1| < Ellen li + elon lla. 


Proof See Sect. 5.1. o 


Remark 5 Recalling Remark 4, the nonlinear term in velocity form fails this 
estimate. It is satisfied in the vorticity form due to the additional regularity that 
Jn has compared to $,. This difference is what enables us to apply Theorem 5.15 
directly in the case H :— wi for the vorticity form, whereas for the velocity form 
the appropriate estimate is only satisfied for H := w2? (see Assumption 5.8) hence 
the need for Theorem 5.22 in velocity form. 


In the following g is defined by 


roe Í, kesio 


as in Theorem 4.2. The subsequent lemma is in analogy with Lemma 3.9. 


Lemma 4.7 For any & > 0, we have that 


|(PLa5q6b — Posee. — V)| s c) [If + lt] Ié— IP elló— ls 


Proof See Sect. 5.1. o 


4.4 Proof of Theorem 4.3 


As discussed we apply Theorem 5.15, which we do for the spaces 
V i= W2? (0; R3), H= wt? (0; R°), Us L? (6; R5). 
Proof of Theorem 4.3 The density relations are clear as Cos, (C; R) c 


W2?(6; IR?) is dense in both W1?(6; IR?) and L2 (6; R2). The bilinear form 
(70) is simply again (37). Now we shift attentions to checking that the operators 
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are measurable into the correct spaces. We note that “gs, has improved regularity 
properties over £ given item 3 of Theorem 4.2, so retains the continuity with 
measurability following. There is no change to the Stokes Operator from Sect. 3. As 
for P£, we in fact first show that Y € C (w2?(o; R3); W1?(6; R)) (and hence 
is invariant under 7^). This consists of three parts: showing that it is continuous as 
a mapping into W'?(6; R?), showing the divergence free property and then the 
Zero trace property. In fact with the appropriate regularity, it follows identically to 
(18) that we again have 


lol y < clé liye lelie. (48) 


which addresses the continuity. The fact that .Z;$ is divergence free comes 
immediately from the relation .Z; = curl (B;[BSx@]) and the well established fact 
the divergence of a curl is zero. For the zero trace property it is sufficient to show the 
existence of a sequence of compactly supported y, € W?(6; R?) which converge 
to Yo in W?(6; R?). By definition of the property that £j € WÈ? (OR?) there 
is a sequence (Yn), Yn € Cg (€; IR?) such that y, — & in W>? (OR?). Evidently 
Nn := Ly, is again compactly supported, and observe that 


IZy.% — Lllwi2 = IZ, blw < cllyn — &illwz2lóllwz2 


from (6), which converges to zero as required to justify the zero trace property. The 
fact that PL? e C (W2? (0; R3); L2 (6; R3)) again follows from the linearity, 
(48) and continuity of P. We now proceed to justify the numbered assumptions of 
Sect. 5.3. 


Assumption 5.6: We use the system (ag) of eigenfunctions of the Stokes Operator. 

Assumption 5.7: Items (74), (75) follow identically to the justification of (87), 
(88) for (23) given the increased regularity of & over £ and the corresponding 
boundedness of the noise term (48). With the linearity of .Z; then (76) follows 
trivially from (48). 

Assumption 5.8: (77) now follows from Lemmas 4.5 and 4.6 in the same manner 
as for the velocity equation. (78) is shown exactly as (82) was for the velocity 
equation, using that the P, are orthogonal projections in W1?(6G ; IS). 

Assumption 5.9: The justification now comes together exactly as in the proof for 
Assumption 5.17 in the velocity case, noting again that (19) holds for .Z; as well, 
and using Lemma 4.7. 

Assumption 5.10: There is very little to demonstrate here, as the linear terms follow 
from Assumption 5.9 so we just briefly address the nonlinear term. Through the 
same process as in Lemma 4.7, we have that 


^ As PY is equal to.Z; on W22 (6; R?), then (P. is equal to PL? on this space too, justifying 
the consistency between taking the Leray Projector and then converting from Stratonovich to Itó 
and vice versa as seen for (23). 
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|(PLyrb. p| x G0. 0) + (Lo f. à) 
< c [Ilf llaliól: + loll fit lll 
< colliei 


where the rest simply follows as in Assumption 5.9. 


Assumption 5.11: We consider the different operators in turn, starting with the 
nonlinear term and using that 


PZ — PLew, n)| = Gro — £o f — Co + Lyg.n)| 
=|(Lp-gh+Lo(o — W)—Lo-v f — Lylf — 8). n)| 


where exactly as in Lemma 4.7 we have that 
IL p-gb + Lelo — V) — Low f — Ly Cf — oll S clliél + Ivhillé — vili 
so in particular 
(P5 — Psy. m| < Ill Ce Ulélh + lvl le — vl - (49) 
For the Stokes Operator we simply apply Proposition (12) to see that 
(Ad — Ay, n)| = (A@— Y), ml = ($ — Y, ni < lo = Ylli — GO) 


and for the ge term we use the adjoint characterisation to observe that 


LEE — .Z2y, m = ICZ2(6 — v), m= 1C (6 — v). Bn 
< ellé lalo — Ylin. (51) 


Combining (49), (50) and (51) gives the result. 


5 Appendices 


5.1 Proofs from Sects. 2.3, 3.2, and 4.3 


We begin with the proofs from Sect. 2.3. 
Proof of Proposition 2.6 We begin with showing (19), tasked to control 


(B? f, f)wka + IIB: f Ma (52) 
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and do so with each derivative in the sum for the inner product: that is, we are 
looking at 


(D* B? f, D? f) + (D? Bi f, D" B; f) (53) 


where we simplify the derivatives by writing 


D” Bg f = > Bide DP f 


a! xa 


=) Bp " D” f + B D? f. (54) 


a’<a 
Plugging this result in, we also see that 
D" B£, f = D" Bg, (Ba f) 


= > B psi, D" Be, f + Bg D" Bg f 


a' «a 
which will use in our analysis of (53), reducing the expression to 
(F Bp DY By f + By D* By f, D* f) + (D* By f, D By f) 
a! «a 
which we further break up in terms of the adjoint Be: 
( Y. hee" Bs, f. D® f)+ (D" Bs f, B% D° f) -(D* Bg, f, D* Bg f) (55) 
a’<a 


requiring that D% f € W'?(O; R?), which is satisfied by the assumption f € 
W**2.?(O:; R?). By summing the second and third inner products and using (54), 
this becomes 


( D Boone's, D” By f, D" f)+{D" By f, B D° f+ > Bpawg D" f--ByD* f) 


a’<a a! «a 
which we look to simplify by combining BE and Bg , noting that 
B? + B; =L} +T + Ly + T= Te + Fi. 


Indeed we arrive at the expression 


( x Hyg D” Bah, D* f) - (D* By f. (7& TD b Bp Dy). 


a' <a a! «a 
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As we are looking to achieve control with respect to the W*:7(O; R3) norm of f, 
then it is the terms with differential operators of order greater than k that concern 
us. Of course this was the motivating factor behind combining Bg and its adjoint, 
nullifying the additional derivative coming from £g. There are more higher order 
terms to go though, and the strategy will be to write these in terms of commutators 
with a differential operator of controllable order. This will involve considering 
different aspects of our sum in tandem, which will be helped with (54) reducing 
our expression again to 


( 27 B peu, D" By f, D? f) + ( 2- Bp«-5g D^ f + Bg, D^ f, (Te; T T;;) D* f 


a’ <a B<a 


+ 25 Boeg D“ f) 


a! «a 


Ultimately the terms in the summand are split up into 


(Bg, D® f, (Tg + Tg) D? f) (56) 

H 2. Bpe-£g D? f, (Ta, + TE)D* f + b» B pae, D* f) 99 
B «o a! «a 

+) (Orca D” Bg, f, D* f) + (Bg, D" f, By, ug, p" f) (58) 


with the intention of controlling each one individually. Firstly for a treatment of 
(56), 


(56) = (a, + T) D* f. (T + Ta) D* f) 
= (Ly D" f, Tg D* f) + (Li D* f, Ty D* f) 
+ (Tg, D* f, Tg D* f) + (Tg D* f, Tg D* f) 
= (Te Li D* f, D° f) + (£s, D* f, Tg D" f)) 
+ ((7ZD" f, D* f) + (Ta D* f, Te D" f). 


We now bound the brackets in terms of || D^ f |? separately, starting with the latter 
one as 


(T£ D* f, D® f) < IT D* FID fli < cliéillwioellTg D* FIND® fl 


2 2 
< clli llis D F 


and similarly 
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(Tg, D" f, Tg D* f) < Te, D" FINTE D" FI < clés ll D® FIP’. 


Now for the first bracket, we add and subtract a term to have an expression through 
the commutator of the operators: 


Ug C D? f, D" f) + (Le, D" f, Tg D" f) 

(Cg, Cg, — Cg, Tg) D f, D" f) + (Cg Te, D" f, D" f) + (Cg D* f, Tg D" f) 
(Te Le — £g TE) D" f, D" fy + (Te; D* f, £E D" T) + (Gg D* f, Te D* f) 
—(U Le, — £a Tg) D* f, D^ f). 


The commutator term is given explicitly through 


Tyly D* f = n(o") 


and 
3 
Le Tu D" f = Le,( X D“ f*vi) 
k=1 
3 3 
=) 105 D" f" v& ) 
j=l k=1 
3 3 i 
= S 8/9; (D* f VEF) 
j=l k=1 
a 3 . 
E AGE D* f* vg + EÏ D° f*ajVE ) 
j=l k=1 
such that 


3 3 
(Te, £g — Le, Ta) D” f = — dds J p" faj ver. 
j= 1 k=1 


On the 3D Navier-Stokes Equations with Stochastic Lie Transport 87 


Therefore 


$3. 3 


Il (Tg £g — £g TE) D fl € cV Dlg! D? f*a; vs 


j=1 k=1 


< RAR" D* fta jatt l0 


3 3 3 
=] k=1 l=1 


359 ll ajat leom ID f* lom 


3 3 3 
< cll lizo X 3. YID" flo 
j= 


1 k=1 l=1 

< clé; lyx llD* f l| 

and 
(TaLe — Le Tg) D" f, D" f) < Tg Li — Li Tg) D" f IID” f 
< cli&ill 2s. ID" f IP". 

Combining these inequalities we determine the bound 

(56) < clli lyzl D* f. 
As for (57) we look to use Cauchy-Schwarz and bound each item in the inner 


product. Indeed straight from (18) in the L? (O; R?) setting, by simply replacing 
& with D*-P £j, we have that 


lB pe-p, D? fI? < cll D* PE Eio ID f louis < elle eise llf Moa 


Moreover, 


| J B» p^] < c | Bps-sg DP fI? < chéill f Ipsa. 
B<a B<a 
In addition to this, 


l(7& + Tg) D" fll S IT D" FIL + TG D" fl S cllEill wero D" fl 


using the equivalence in operator norm of the adjoint. Together this amounts to 
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60 «Y: gon or - trs e ror E most F| 
psg a! <a | 
< || J Bore, p^ r (vcr TO)" fl | Y Bpa- p" j|) 
PR a! «a 


E cli Mio lf Mo (elg; Iwil D* f + cli eet fllwe2) 


2 2 
< ellés liyt, Flee: 


Let's now turn our attentions to (58), which for each a’ in the sum we rewrite as 
(D? f, B pa-e'g, De Bs, f + BE B pa-e'g, D* f) (59) 
and employing (54) again we see this becomes 
p" f Bes 2 Bowe, D? f + Bẹ D* f) + Bg, B pa-w, D" f) 
p «o! 


~ n". 2B Bow- D^ f] 


B<a’ 
+ (OD? f, Bpa-a'¢, Bg, D” f + BE Bp; D” f). 
We have split up these terms to make our approach clearer, as the two right hand 


sides of the inner products will be considered separately. For the first inner product, 
two applications of (18) give that 


IB pawg Bpa- D^ fI? < clD*7 502, ul Bps DP Fra 
«entes iu (ci? Em ID flos) 
< ellé: liye llf Moi 


Moreover, 


2 
x el Bysculs yas DP FI? 


| 2 | Bee's, Bpe-s D^ f 


B<a’ 
4 2 
< ellé ll wise ll f yia 


As for the second inner product, we rewrite the right side as 


B po-o' (Le, RE T&)D*. f) s (Le, + TE) B pa-a'g, Df 
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and further 


(Bps-e';, Le, — £g B pa-a' g, ) D" f+ Bpo-a'¢, Tt, D” f+ TE B pa-a' ¢, D” f. 


Starting with the latter two terms, 


lB po-o Tg D* fl? < cll D°~* ki liyi lTi D® Fiy 


and likewise 


—ü 2 2 : 2 
< e| D*77 & lyr. (ella lD” fla) 


4 2 
< chlé liyer lf ly 


ITË Bps-eg D" fI? < clés pee; D® fll? 


2 —a'! 2 4 2 
< eli yro (cll D7 E: Nyel D” Flys) 


4 2 
< ell&i lyer ll filer: 


Now we show explicitly that the commutator in 


E Ug Bei) 7 


89 


(60) 


(61) 


from (60) is of first order (so of k™ order when composed with D’), through the 


expressions 


B pa-e'g, Li, D” f= 


and 


3 (os o;( 3 &'oD"' f) 


j= k=1 


4 ( y eka D™ DEZ ) 


k=1 


= 


3 3 , 
-5 (Deaan s 


+ De El £39, D" f + Eka, D™ f) v DY gl ) 


3 3 


Lg Byo-a'z, D^ f= 2.8 aX o"- Uta DY f+ p9 FIV De E; ) 


= pel 
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3 3 
EE (sans 9; D" f + EED Ei 9,9; D" f 


+ Ek ak p* fiyp— g Es epe fi avo= gi) 


such that 


3 
6D=>) 
j=1k 


3 
C 3,5 3, D" f — &a, D"-" &aj D f 
1 


= gp" fi xvn) 


and in particular 


5.3 
IéDI se» >, (io £/ 0;5 ODY f|? + NEED Ea; D" f|? 


1 k=1 


- 


+ lef D" fiav De“ gi i") 


E 3 
1220302 (ioe 5/ 8j5 0 D" Tow 


jal k=1 i21 


ee a _ E 
+ EKD £8; D" f' 525. + EF D® f 9,8 D^" E xoa) 
3 


3 
< clli lyze YD ». (iios flim) 


j=l k=1 l=1 
! el 2 ! el i2 
T 8; D” f l'r2(0:8 S ID” f Ia, 
3 3 3 3 
4 2 2 ! e2 
< elé Iyer ( Ina YI + 1 Dodo" fi ) 
j=l k=1 j=l k=l 
4 2 
< clli yezo llf y2: 


Finally now we can piece these four inequalities together to produce a bound on 
(59): 
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\(59)| < I>" ni * Fae Byun, DPF + (60) 


B<a’ 


< i> n 3. Bowe, Bpe-eg D d + IB pag, Te D" f| 


B «a' 


+ ITE B pawg, D” fl [2 
< CAD" ritis tn wi + El roe lf Mia 


+ léi lioll fllwe2 + lisse Iii) 


2 
EXTITERIT 
2 2 
< elgi lyzl flea 


and subsequently of (58): 
68)= 35 69 <c V Een se M f My < clé zol fye 
a' «a a! «a 
We can now conclude the proof of (19) by observing that 
(52)= ) 63) 
|a| xk 


= $;69- 60-69 


a|xk 


<c) (iiio nr + ME liyer llf l yez + les aa) 


a|xk 


2 2 
Se eller lye 


a|<k 


2 2 
=e Ei lyro fll qye.2- 


As for (20), using (54) once more, we see that for each o in the sum for the inner 
product, 


KD" B; f, D* f = || X Bp D" f + Ba D* f, Df 


a «o 
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= ( S Beg" 7, D” f) + (Be, D* f, D* f) 


a «o 


IA 


2s Bway, D" f, D” f) + I(T; D” f, D“ f) 


a’ «o 


using the cancellation from (15) to dispose of the order k + 1 term. In our treatment 
of (57) in (19), we showed the bound 


| 2 Bpo-ig DP f 


B<a 


< ellé; ll wise llf wi 


and therefore 
(E Bose DY f, D^ f)| < elgi lwil wez" FI 
a' «a 


2 
< ellé; lwe llf yi 


whilst a simple bound on the second term yields the result. o 


Proof of Lemma 2.7 For Vg € L21 (0; R3) N Ww'?(0; R?), 


Bi (V8) = Le (Vg) + T (Va) 


3 3 
=) gavo +} ajgvEj 


j=l j=l 

3 3 . 

= M g(vajg + X (VE! )a;8 
j=l 


j=l 


3 
=y A Rs 
j=l 

2,4 (9. R3 

€ Lg- (0; R^) 
using in the last line the assumption that Vg € WP'?(O; R?) and that & € 
w!-°(O; R?). We now make a distinction between the settings of T? and @. For 
the bounded domain @, we take any f € W?(6; R?) and use the representation 


(9) to see that 


PBif = PBi (Pf - Vg) PBiPf -P(BiVg) 2PBiPf 
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as required, using again the W!?(@; RÌ) regularity of both components of the 
decomposition (9). In the case of the Torus we must address the constant term in 
the decomposition (10), appreciating that 


3 3 
Bic = Tgc = » cl VE! =V ser € Lom. R?) 
j=l j=l 
so the result follows in the same manner. a 


We now state two results used in proving the Lemmas of Sects. 3.2 and 4.3. The 
first is derived from the Gagliardo-Nirenberg Inequality, whilst the second is proved 
below. 


Proposition 5.1 There exists a constant c such that for any f € W}:?(T?; R?) and 
g € W2^ (T5; R°), 


1/2 1/2 
lC rell < ell float igi. (62) 
Meanwhile for f € W1?(6; R5) and g € W2?(6; R3) we have that 


lssi < eli flh (Ie eli Iles). (63) 


Proposition 5.2. There exists a constant c such that for every f € W*?(O; R9), 


[A, BISI? < céill 


where [A, Bi] is the commutator 
[A, Bj] := AB; — B; A. 


Proof We fix any such f and first show that 


3 3 
[A, Bilf = Y 9 (okela f 26] 3.0, f + 2 fH vi] + F/aRVE!) 


k=1 j=1 
(64) 
Indeed 
3 3 . . . 
ABif 2 Y | (89) + f! vgl) 
k=1 j=l 


3 3 
- 9. | 3 (asl) + $0; f of vi + fave) 
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3 3 
=J) (Blaj f+ 2.6/0; f + &/ Ba; p + od ri vl 


+2 f AVE] + fadi) 


and 
3 3 
voee) 
j=l = 
2 3 . 
- Y Y (gsto;r + a2 rival) 
k=1 j=1 
therefore 


[A, Bil f = ABif — Bi Af 
23 (asi 9; f + 20,67 8,9; f -- 20, // 9, VE] + f? alve) 


justifying (64). The result then follows with direct calculation: 


2 


3 o3 
ITA, BAS I? | 9 27 (asi 9; f 20,6] 9,8; f + 28k f^ 9, V&] +f! avg) 


k=1 j=) 


3 3 
<c) D [ace 0j f 28,87 9; f £23 flavel + Pavel | 


3 3 
=D YOY aeaf «22/00; f + 29 P) 036 


k=1 j=1 l=1 


+ piazae! |. 


L?(O;R) 


3 3 3 
TPSPSPSICCCDTEPPRENPCTDTU t 


k=1 j=1 /=1 


+ 122, f 8.818! om + £7 978157 Daem) 
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3 3 3 
sAile DDD, (08; f hom + 189) f ow 


k 1 j=1 l=1 
+ lak f N? HIFI 
KJ” ! T2(0:R) L?(O;R) 
2 2 
< cli Nys Fg 


Oo 


We now prove Lemma 3.7 through the properties (26), (27) and (28) indepen- 
dently. 


Proof of (26) We use the established result that for k = 2 then the Sobolev Space 
W*?(O; R) is an algebra (a result first presented in [46]), to deduce that 


3 
Lp, fallw22 =| 0 fi 8j fa < cll fallall falls. 


j=l w22 


From here we simply use that P, is self-adjoint and Young’s Inequality to see that 


KPaPL s, fas fn)2| = PL f fas fn)2| < CONF + ell falls. 


Proof of (27) As the P, are self-adjoint we can readily justify the inequality 
(PaP B? fas faji + (PaP Bi fas PaP Bi fu) < (PB; fus fa) + (PBi fa. PBi fn) 
and moreover from (12) that this is just 

(PB? fu. Afa) + (PB) fn, APB; fn). 
We rewrite this as 

(P Bi? fa, Afn) + (P Bi fn, ABi fn) 
and further as 

(PBi fn, Bi Afn) — (P Bi fn, ABi fn) (65) 
for the adjoint B¥ = £g + T We look to commute the Laplacian and B;, using 


Proposition 5.2 and subsequently the cancellation of the derivative in B; when 
considered with the adjoint. Indeed, 
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(P Bi fn, A Bi fn) == P Bi fn, ([A, Bi] + BiA) fn) 
P Bi fn, [A, Bi] fn) = (P Bi fn, P Bi Af) 
P Bi fn. LA, Bi] fn) + (P Bi fn, P Bi Afn) 


( 
( 
( 
(P Bi fn, LA, Bil fn) + (P Bi fn, Bi Afn). 


Thus (65) becomes 
(PBi fn, B Afn) — (P Bi fas U^, Bil fa) + (P Bi fs, BiAfn) 
or simply 
(PBi fn, (T + TE)Afn — [A, Bil fn) (66) 


which we look to bound through Cauchy-Schwarz and the results of (18) and 
Proposition 5.2 to see that 


(66) < IPB; full (Tg + TEVA fall + IA, Bil full) 
< clé lwi folla (Milwr LA fall + Ii hse fullw22) 
< cl lwi full (Illis Falla + Hall fll) 
< CCE) Néi liya lfa li + eli lgl fa o 


as required. o 


Proof of (28) As with (27) we can immediately say that 


(PaP B2 fn, fa)2 + (PaP Bi fa, PaP Bi fa)2 
< (PB? fa, fn)2 + (P Bi fn, P Bi fn)2 (67) 


which we again manipulate to give 


(67) = (AP B? fa, Afa) + (AP Bi fn, AP Bi fn) 
= —(PAB? fn, Afa) — (ABi fn, PAB; Jn) 
= —(P[A, Bi]Bi fn + PBiABi fn, Afa) — (ABi fn, PIA, Bil fa + P Bi fn) 
= —(P[A, Bi]Bi fn, Afa) + (PBiABi fa, Afa) — (ABi fa, PIA, Bil fa) 
+ (AB; fn, P Bi Afa) 
= (B; AB; fn, Afn) + (ABi fa, Bi Afa) — (PLA, Bi] Bi fa, Afa) 
— (AB; fn, PIA, Bil fn) 


( 
( 
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= (AB; fn, (Bi + B*)A fn) = (PIA, Bi] Bi fn, Afn) = (A Bi fn, PIA, Bil fn) 
= (AB; fn, (Ti, + TEDA fn) — (PIA, Bi) Bi fn, Afn) — (A Bi fn, PIA, Bil fn). 


We shall treat each term individually using Cauchy-Schwarz, Young’s Inequality 
and Proposition 5.2 in the same manner as the proof of (27): 


(AB; fs, (Te, + TEDA fn) < WAB: fall ll Tg, + Te) Aa 


€ 
< CCE sll Fall + 5 Mi Mg fa l3 
as well as 


— (PIA, Bi] Bi fn, Afn) < IPTA, Bi) Bi fll | Afnl 
< ¢ll[A, Bi1Bi fall full 
< c|l&ilwsoe | Bi fn 22 Il fall 


2 
< ellés lyy. ll fn ll ws M fall 


€ 
< CCE yscoll Falla + 5 NSi yso fn l3 
and finally 


— (AB; fa, PIA, Bil fa) < IIA Bi fallll PLA, Bil fall 
< cll Bi fall w22llA, Bil full 
< elgi lys. Il full 32 fall w22 
E 
< cli Ms Ml fl + Mi ys. l fal 


Summing these up completes the proof. o 


Proof of Lemma 3.8 Observe that 


(PLy f — P£,g, f - gh = (PLf f — Leg, ACF — 8) 
= (Ly-g f + £_(f — 8), ACÉ — 8) 


and so it is sufficient to control the terms 


IE paf, ACF — 8))| < LC g- FIAC — lI 


< c) FIBI — slit + Sf — el 


and 
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|(Le(f — 8), A — 8))| < ILe(F — eUIACE — 2) 


1/2 1/2 
< eleli f — el^ WF — ell Wf — llo 


€ 
< c)llelftllf — sli + 51 — alb 


using (62) and Young's Inequality with conjugate exponents 4 and 4/3. o 


Proof of Lemma 3.9 As in Lemma 3.8, we use the inequality 


KPLff —PLg8, f= 8| < Lrf f= lH F e)l. 


For the first term, appealing to (4), observe that 


Ls- f, f — 9| < Es- fF — gl < COIFIGIS — el + elf — gii- 


The second term is null due to (15), which concludes the proof. oO 
Proof of Lemma 4.4 We consider the proofs individually. 


Assumption (45): We look to use the same approach as the one for (19), and given 
the similar structure of .Z; to B; we provide only a sketch of the proof addressing 
the key properties. The essential component of the proof is in removing the 
derivative dependency coming from the transport type operator. Inspecting the 
proof of (19), there are three key properties that should be shown. 


. £* + . Zi is of zeroth order. 

. Defining Q; by Q; : f — ZCjéi,then the commutator [Q;, £g] is of zeroth 
order. 

3. For |æ| < 1, the commutator [-Zpeg; , Lg; ] is of first order. 


Noe 


We note that property 3 can be shown for arbitrary œ if we assume sufficient 
regularity for &;, as we did in Proposition 2.6. Property 1 is clear from the same 
structure Z7" = —£g + Q7? where Qj, O7 are of zeroth order. We calculate the 
commutator in 2 explicitly, acting on f € W!?(G@; RÌ): 


k 
3 3 3 
Qi£& f =Y (Cf) && = Y | Y 89;f | ae 


k=1 k=1 \j=l 


3 3 


-Y Y gafas 


k=1 j=1 


and 
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3 : 3 
Le, Qif = Y £a; (Qi f) = » H (x fas) 
j=l =1 k=1 
X d o 
=X J 8l (ajf akti + f jakti) 


hence 


3 3 
(Qi, £4] 2 — 0 9,6] f 8j8& 


j=1 k=1 


which is of zeroth order. As for 3, the term which needs to be addressed is the one 
[Lpes;, £g] which was already attended to in the original proof, so the result is 
concluded here. 


Assumption (46): Comparing to the proof of (20), this is a consequence of the 
property (15) once more and the same boundedness of (18). 
Assumption (47): Once more the critical term is [A, £z ] which was addressed in 
the proof of Proposition 5.2. 
Oo 


Proof of Lemma 4.6 Note that 


|(PrPLBS pb, Pn; $n)1 | = (Ps P Z5, by Pn; Apn) | = CCENN-LBS gy Pn I? + Ellon IE 


and 
lZasco dul < 2 (Insee nll” + ILo, BSx nll”) 


so we look to control these two terms. Indeed, 


3 3 
j2 k2 

lLBsko dul < c 9 J BST ecr lAN oR 

j=l k= 

3 3 


gni 


€ c3 Y IBSkdi yog) 19; P F202) 


PIE 
2 2 
= K Qn 2,2 n 1,2 
< cll BSx Gn l2y22llball2, 
4 
< clloallyi2 


4 
< clien] 


j= 
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using the Sobolev Embedding of W??2(0;R) —> L™©(@;R) and item (3) of 
Theorem 4.2. Likewise observe that 


j 2 ky2 
DA ZA lO, BSk o, ICA 


D 
E 
x% 
> 
E. 
n 
eo 
Me 
Me 


3 
jy) 2 k2 
<c CARE 6-R) 9; BSk ball wi2e-R 
(@;R) (@;R) 


2 2 

< ellgallyi2ll B Sk bnllye22 
4 

zs clon lly. 


4 
< clin] 


Summing these terms completes the proof. o 


Proof of Lemma 4.7 We write out the left hand side in full: 


|P ZeBsreb —PLaswv.o—w)| 
= |(LgBskob — LoBSko — Lesky V + Ly BSkv.o—)| 
= | nsgo-Bsgu d + Casgu(Ó — V) — Co-y BSkó 
—Ly (BSkó — BSK Y), — v)| 


from which we shall split up the terms and control them individually. Firstly, 


lC Bsko-Bsyvó. $ — V)| < IC asco-Bscvélllló — vii 
< c|BSk ($ — Vllllellilió — vili 
< cl — vilhilleliilié — viii 


< cello - WIP + $g- vill 
using (4) and that [B Sk ó — BSxw](x) = fe K(x, y)ó — v]CGo)dy is the solution 
specified by B Sg (@ — y) in Theorem 4.2 for @ — y. Even more directly we have 
that 
(Lesky lh — V). 6 — y) 20 


owing to (15), and for the final two terms the bounds 


|Za- y BSkó. $ — V)| < cll — WII BSxdllall¢ — vill 
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< c()leéllé — vi? + ;ló - yl 


and 
|Ly(BSKo — BSKY), $ — V)| < cly lhillBSKo — BSk ylle — vili 
2 jq 2 
< eyle = vI + 316 = vh. 
Summing these terms concludes the proof. o 


5.2 A Conversion from Stratonovich to Itó 


This theory is taken from [30] Subsections 2.2 and 2.3, and is provided here for 
simplicity to apply in Sect. 3.3. We work with a quartet of embedded Hilbert Spaces 


V= He Ux 


where the embedding is assumed as a continuous linear injection. We start from an 
SPDE 


t t 
V, =o + f Quas + f GW, o dW;. (68) 
0 0 


where the mappings Q, G satisfy the following conditions, with the general operator 


K : H — R defined by 


K(9) = c (1+ lol? + IIT) 


for any constants c, p, q independent of $. 


Assumption 5.3 Q : V — U is measurable and for any $ € V, 
Qu < KU + Ill]. 
Assumption 5.4 G is understood as a measurable mapping 
G: V > S? (st; H), G: H > Z (s; U), G : U > Z4; X) 
defined over 4 by its action on the basis vectors 


GC, ei) = Gi C). 
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In addition each G; is linear and there exists constants cj such that for all € V, 
vVeH,neU: 


oo 
2 
lgiélla < cilléllv. lgivllu < cillVlla. WGinlx < cillnllu. JO e? < oo. 
i=l 


In this setting, we have the following result ([30] Theorem 2.3.1 and Corollary 
2.3.1.1). 


Theorem 5.5 Suppose that (W, 1) are such that: t is a IP — a.s. positive stopping 
time and W is a process whereby for IP — a.e. œw, V.(w) € C([0, T]; H) and 
V. (o)1.-r(5) € L? ([0, T ]; V)forall T > 0 with W.1.—. progressively measurable 
in V, and moreover satisfy the identity 


tAT 1 oo [^T 
V, = V, +f Q-4- GF Yds +f GW dW, 
0 = 0 


IP — a.s. in U for all t > 0. Then the pair (W, x) satisfies the identity 


tAT tAT 
V, = Yo + f QW, ds + f GW, o dW, 
0 0 


P — a.s. in X for all t > 0. 


The mapping 1 bw G? is understood as a pointwise limit, which is justified in 
[30] Subsection 2.3. 


Remark 6 Practically, Theorem 5.5 provides an It6 equation from a Stratonovich 
one in the sense that solving this It6 equation is sufficient to satisfy the identity 
in Stratonovich form. To discuss an equivalence between the equations we would 
need to formally define a solution of the Stratonovich equation, which we do not do 
here. To make sense of the Stratonovich integral one would have to impose that the 
solution is a local semimartingale in U, in which case the two notions are genuinely 
equivalent. 


5.3 Abstract Solution Criterion I 


The result is given in the context of an Itó SPDE 


t t 
V, = Yo +f A(s, Ys)ds +f G(s, V .,)dW;. (69) 
0 0 


We state the assumptions for a triplet of embedded Hilbert Spaces 
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Vo HoU 


and ask that there is a continuous bilinear form (-,-)yxy : U x V — R such that 
for $ € H and y c V, 


($, V)uxv = (6. V)g. (70) 


The mappings A, G are such that for any T > 0, A: [OL T] x V > U,G: 
[0, T] x V —> Z? (4; H) are measurable. We assume that V is dense in H which 
is dense in U. 


Assumption 5.6 There exists a system (ay) of elements of V such that, defining the 
spaces V, :— span {a1, ..., an} and Py, as the orthogonal projection to V, in U, 
then: 


1. There exists some constant c independent of n such that for all € H, 


IP,el2 < clol. (71) 


2. There exists a real valued sequence (un) with [ty — oo such that for any $ € H, 


1 
I — Plu < "iL (72) 


where I represents the identity operator in U. 


These conditions are of course supplemented by a series of assumptions on the 
mappings. We shall use general notation c; to represent a function c. : [0, o0) > R 
bounded on [0, T] for any T > O0, evaluated at the time t. Moreover we define 
functions K, K relative to some non-negative constants p, p, q, q. We use a generic 
notation to define the functions K : U —> R, K :U x U > R, K : H —> Rand 
K:H xH > Rby 


K($) :=1 + ll, K($, y) :=1 ++ lol? +yli, 
Ř ($) := K($) + lol, Ko. y) := K($, V) + loll? + WIG 


Distinct use of the function K will depend on different constants but in no 
meaningful way in our applications, hence no explicit reference to them shall be 
made. In the case of K, when P, q = 2 then we shall denote the general K by Ka. 
In this case no further assumptions are made on the p, q. That is, K2 has the general 
representation 


Ralo, Y) = KG, V) + lll; + Wy (13) 


and similarly as a function of one variable. 
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We state the subsequent assumptions for arbitrary elements $, y € V, $" € Vn, 
n € H and t € [0, oo), and a fixed x > 0. Understanding G as a mappingG : 
[0, co) x V x U — H, we introduce the notation C; (^, -) := G(-, -, ej). 


Assumption 5.7 
AG DI + FIGE DI = aKO) [redes]. (14) 
iz 
IAG, 6) — AG, Wl < e [KG V) + lolly + Ili | lo — vili 
(75) 
3121 $) — Gi (t. Wily < eK. Wild — Wily. (76) 
iz 
Assumption 5.8 
2(PrA(t, $^), d") + Y ine. Ni < «R0 [1 + 1^1 | - teni. 


i=1 
(77) 


SO (Pagi, 6"), 9")7 < c Kalo") L + CAIA . (78) 
i=l 
Assumption 5.9 
UAG, 6) — Alt, V). 6 — Wu + DONG. 4) — Gilt. Wil 


i=l 


< & Kalo, We — VIT, — klló — Ylli, 


(19) 
)»»7 (t. 6) — Gilt, Y), 6 — Wy < aKx($. Wild — vlc (80) 
il 
Assumption 5.10 
UAM, 0). )u + Y i6. DI <crK) [1+ loli], (81) 


i=l 


LG. Hh = OKO) [1+ Ist | (82) 
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(AC, p) — AG, Y), nu Sa + Ini [K@, V) + lléllv + llrlvilié — yla. 
(83) 


With these assumptions in place we state the relevant definitions and results, 
first announced in [31] and proven in [32]. Definition 5.12 is stated for an Fo— 
measurable Wo : Q — H. 


Definition 5.12 A pair (V, t) where t is a P — a.s. positive stopping time and VW 
is a process such that for P — a.e. w, V.(w) € C ([0, T]; H) and Y. (w)l.<t(%) € 
L?([0, T]; V) for all T > 0 with V .1.—, progressively measurable in V, is said to 
be an H-valued local strong solution of the Eq. (69) if the identity 


[AT [^T 
V, = Yo + f A(s, Us)ds +f G(s, YAW, (84) 
0 0 


holds IP — a.s. in U for all t > 0. 


Definition 5.13 A pair (W, ©) such that there exists a sequence of stopping times 
(0;) which are P — a.s. monotone increasing and convergent to ©, whereby 
(Wong js 0j) is a V —valued local strong solution of the Eq. (69) for each j, is said to 
be an H —valued maximal strong solution of the Eq. (69) if for any other pair (®, T) 
with this property then © < T P — a.s. implies © =F P — a.s.. 


Definition 5.14 An H —valued maximal strong solution (W, ©) of the equation 
(69) is said to be unique if for any other such solution (6, I), then © =T P — a.s. 
and for all t € [0, ©), 


P ((o € Q : V, (c) = ,(»))) = 1. 


We now state the main theorem in this setting. 


Theorem 5.15 Suppose that Assumptions 5.6—5.11 are satisfied in this framework. 
Then for any given Fo— measurable Wo : Q — H, there exists a unique H —valued 
maximal strong solution (W, ©) of the equation (69). Moreover at P — a.e. w for 
which ©(@) < oo, we have that 


O(w) 
sup ||W,-()|lz, +f IY, (o); dr = oo. (85) 
re[0,6(o)) 0 


Proof See [32] Theorem 3.15. oO 
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5.4 Abstract Solution Criterion II 


We extend the framework of Sect. 5.3, introducing now another Hilbert Space X 
which is such that U — X. We ask that there is a continuous bilinear form 
(-,-)xxH : X x H — R such that for o € U and y € A, 


(6. V)xxnu = (Q, V)v. (86) 


Moreover it is now necessary that the system (a,) from Assumption 5.6 forms 
an orthogonal basis of U. We state the remaining assumptions now for arbitrary 
elements $, y € H and t € [0, co), and continue to use the c, K, K , k notation of 
Assumption Set 1. We now further assume that for any T > 0, A : [0, T] x H > X 
and G : [0, T] x H > .Z? (U; U) are measurable. 


Assumption 5.16 
IAC, Blk + Y "Idi DI < e KE [1 1015 . (87) 
i-l 
A(t, 6) — A(t, VO lx < e EK QS IO) + lella + Illa M — villa 
(88) 
Assumption 5.17 
2(AQ. 9) — Alt, Y), $ — V)x 
+ SONG, 6) — Gilt, VII x cK, Wd — WI, (89) 
i=1 
Di Gilt, 0) — Gilt, Y), 6 — V) < c K2(G, Wild — Wilk (90) 
i=l 
Assumption 5.18 For every ¢ € V, it holds that 
UAE, $), pu + Y Gi Gs Bila < cK (D) — Kl bly, (91) 
i-l 
2 Gilt, $), d) < c K(9). (92) 


Remark 7 Note that Assumption 5.18 is stronger than Assumption 5.10, as we are 
bounding the same terms but we are not afforded a control in the H norm of $ 
in addition to its U norm. Thus in applying Theorem 5.22 it is sufficient to only 
demonstrate Assumption 5.18. 
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Analagously to Susbection 5.3, we state the relevant definitions and the resulting 
theorem in this context (again proved in [32]). Definition 5.19 is stated for an Fo— 
measurable V : Q — U. 


Definition 5.19 A pair (V, t) where t is a P — a.s. positive stopping time and V 
is a process such that for P — a.e. œw, Y.(w) € C ([0, T]; U) and Y. (w)l.<t(œ) € 
L? ([0, T]; H) for all T > 0 with V .1.-. progressively measurable in H, is said to 
be a U-valued local strong solution of the Eq. (69) if the identity 


[AT [AT 
V, = Yo + f A(s, Vs)ds +f G(s, Ws)dW, (93) 
0 0 


holds P — a.s. in X for all t > 0. 


Definition 5.20 A pair (V, ©) such that there exists a sequence of stopping times 
(0;) which are P — a.s. monotone increasing and convergent to ©, whereby 
(Wong js 0j) is a U —valued local strong solution of the Eq. (69) for each j, is said to 
be an H —valued maximal strong solution of the Eq. (69) if for any other pair (®, T) 
with this property then © < F P — a.s. implies © =T P — a.s.. 


Definition 5.21 A U —valued maximal strong solution (V, ©) of the Eq. (69) is 
said to be unique if for any other such solution (9, I"), then © = T P — a.s. and for 
all t € [0, 6), 


P(w € Q: P(w) = ®;(@)}) = I. 


Theorem 5.22 Suppose that Assumptions 5.6—5.11 and 5.16—5.18 are satisfied in 
this framework. Then for any given Fy— measurable Wo : Q — U, there exists 
a unique U —valued maximal strong solution (W, ©) of the Eq. (69). Moreover at 
P — a.e. w for which O (w) < oo, we have that 


O(a) 
sup |W, (@) + T V, (o)lz dr = oo. (94) 
re[0,O(@)) 0 
Proof See [32] Theorem 4.9. oO 


Acknowledgments Daniel Goodair was supported by the Engineering and Physical Sciences 
Research Council (EPSCR) Project 2478902. Dan Crisan was partially supported by the European 
Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation 
Programme (ERC, Grant Agreement No 856408). 

We would like to express our sincere gratitude to the anonymous reviewers of this paper, whose 
detailed and insightful comments have greatly contributed to this publication. 


108 D. Goodair and D. Crisan 


References 


[1] Agresti, A. and Veraar, M., 2021. Stochastic Navier-Stokes equations for turbulent flows in 
critical spaces. arXiv preprint arXiv:2107.03953. 

[2] Alonso-Orán, D. and Bethencourt de León, A., 2020. On the well-posedness of stochastic 
Boussinesq equations with transport noise. Journal of Nonlinear Science, 30(1), pp.175—224. 

[3] Attanasio, S. and Flandoli, F., 2011. Renormalized solutions for stochastic transport equations 
and the regularization by bilinear multiplicative noise. Communications in Partial Differential 
Equations, 36(8), pp.1455-1474. 

[4] Bensoussan, A. and Temam, R., 1973. Equations stochastiques du type Navier-Stokes. Journal 
of Functional Analysis, 13(2), pp.195-222. 

[5] Bethencourt-de-León, A. and Takao, S., 2022. Transport noise restores uniqueness and 
prevents blow-up in geometric transport equations. arXiv preprint arXiv:2211.14695. 

[6] Brzeźniak, Z., Capifiski, M. and Flandoli, F., 1992. Stochastic Navier-Stokes equations with 
multiplicative noise. Stochastic Analysis and Applications, 10(5), pp.523-532. 

[7] Brzezniak, Z. and Peszat, S., 1999. Strong local and global solutions for stochastic Navier- 
Stokes equations. Infinite dimensional stochastic analysis, pp.85-98. 

[8] BrzeZniak, Z. and Slavik, J., 2021. Well-posedness of the 3D stochastic primitive equations 
with multiplicative and transport noise. Journal of Differential Equations, 296, pp.617—676. 

[9] Constantin, P. and Foias, C., 2020. Navier-stokes equations. University of Chicago Press. 

[10] Cotter, C., Crisan, D., Holm, D., Pan, W. and Shevchenko, I., 2020. Data assimilation for 
a quasi-geostrophic model with circulation-preserving stochastic transport noise. Journal of 
Statistical Physics, 179(5), pp.1186—1221. 

[11] Cotter, C., Crisan, D., Holm, D., Pan, W. and Shevchenko, I., 2020. Modelling uncertainty 
using stochastic transport noise in a 2-layer quasi-geostrophic model. Foundations of Data 
Science, 2(2), p.173. 

[12] Cotter, C., Crisan, D., Holm, D.D., Pan, W. and Shevchenko, I., 2019. Numerically modeling 
stochastic Lie transport in fluid dynamics. Multiscale Modeling and Simulation, 17(1), 
pp.192-232. 

[13] Crisan, D., Flandoli, F. and Holm, D.D., 2019. Solution properties of a 3D stochastic Euler 
fluid equation. Journal of Nonlinear Science, 29(3), pp.813—870. 

[14] Crisan, D. and Mensah, P.R., 2022. Spatial analyticity and exponential decay of Fourier modes 
for the stochastic Navier-Stokes equation. arXiv preprint arXiv:2209.14862. 

[15] Crisan, D. and Lang, O., 2020. Local well-posedness for the great lake equation with transport 
noise. arXiv preprint arXiv:2003.03357. 

[16] Crisan, D. and Lang, O., 2023. Well-posedness properties for a stochastic rotating shallow 
water model. Journal of Dynamics and Differential Equations, pp.1—31. 

[17] Da Prato, G. and Debussche, A., 2003. Ergodicity for the 3D stochastic Navier-Stokes 
equations. Journal de mathématiques pures et appliquées, 82(8), pp.877—947. 

[18] Debussche, A., 2013. Ergodicity results for the stochastic Navier-Stokes equations: an 
introduction. In Topics in mathematical fluid mechanics (pp. 23-108). Springer, Berlin, 
Heidelberg. 

[19] Debussche, A., Hug, B. and Mémin, E., 2022. A consistent stochastic large-scale representa- 
tion of the Navier-Stokes equations. arXiv preprint arXiv:2207.07472. 

[20] Enciso, A., Garcia-Ferrero, M.A. and Peralta-Salas, D., 2018. The Biot-Savart operator of a 
bounded domain. Journal de Mathématiques Pures et Appliquées, 119, pp.85-113. 

[21] Flandoli, F., 2013. Topics on regularization by noise. Lecture Notes, University of Pisa. 

[22] Flandoli, F., 1994. Dissipativity and invariant measures for stochastic Navier-Stokes equa- 
tions. Nonlinear Differential Equations and Applications NoDEA, 1(4), pp.403-423. 

[23] Flandoli, F., Galeati, L. and Luo, D., 2021. Delayed blow-up by transport noise. Communica- 
tions in Partial Differential Equations, 46(9), pp.1757-1788. 

[24] Flandoli, F. and Gatarek, D., 1995. Martingale and stationary solutions for stochastic Navier- 
Stokes equations. Probability Theory and Related Fields, 102(3), pp.367—391. 


On the 3D Navier-Stokes Equations with Stochastic Lie Transport 109 


[25] Flandoli, F. and Luo, D., 2021. High mode transport noise improves vorticity blow-up control 
in 3D Navier-Stokes equations. Probability Theory and Related Fields, 180(1), pp.309—363. 

[26] Flandoli, F. and Romito, M., 2008. Markov selections for the 3D stochastic Navier-Stokes 
equations. Probability Theory and Related Fields, 140(3), pp.407-458. 

[27] Flandoli, F. and Schmalfuss, B., 1996. Random attractors for the 3D stochastic Navier-Stokes 
equation with multiplicative white noise. Stochastics: An International Journal of Probability 
and Stochastic Processes, 59(1—2), pp.21-45. 

[28] Gie, G.M. and Kelliher, J.P., 2012. Boundary layer analysis of the Navier-Stokes equations 
with generalized Navier boundary conditions. Journal of Differential Equations, 253(6), 
pp.1862-1892. 

[29] Glatt-Holtz, N. and Ziane, M., 2009. Strong pathwise solutions of the stochastic Navier-Stokes 
system. Advances in Differential Equations, 14(5/6), pp.567—600. 

[30] Goodair, D., 2022. Stochastic Calculus in Infinite Dimensions and SPDEs. arXiv preprint 
arXiv:2203.17206. 

[31] Goodair, D., 2023. Existence and Uniqueness of Maximal Solutions to a 3D Navier-Stokes 
Equation with Stochastic Lie Transport. In Stochastic Transport in Upper Ocean Dynamics 
Annual Workshop (pp. 87-107). Springer, Cham. 

[32] Goodair et al., 2023. Existence and Uniqueness of Maximal Solutions to SPDEs with 
Applications to Viscous Fluid Equations. Stochastic Partial Differential Equations: Analysis 
and Computations. pp. 1—64. 

[33] Goodair, D. and Crisan, D., 2022. On the Navier-Stokes Equations with Stochastic Lie 
Transport. arXiv preprint arXiv:2211.01265. 

[34] Hairer, M. and Mattingly, J.C., 2006. Ergodicity of the 2D Navier-Stokes equations with 
degenerate stochastic forcing. Annals of Mathematics, pp.993-1032. 

[35] Holm, D.D., 2015. Variational principles for stochastic fluid dynamics. Proceedings of the 
Royal Society A: Mathematical, Physical and Engineering Sciences, 471(2176), p.20140963. 

[36] Kelliher, J.P., 2006. Navier-Stokes equations with Navier boundary conditions for a bounded 
domain in the plane. SIAM journal on mathematical analysis, 38(1), pp.2 10-232. 

[37] Lang, O. and Crisan, D., 2022. Well-posedness for a stochastic 2D Euler equation with 
transport noise. Stochastics and Partial Differential Equations: Analysis and Computations, 
pp.1-48. 

[38] Lions, P.L., 1996. Mathematical Topics in Fluid Mechanics: Volume 1: Incompressible 

Models (Vol. 1). Oxford University Press on Demand. 

[39] Liu, W. and Róckner, M., 2015. Stochastic partial differential equations: an introduction (p. 

266). Cham: Springer International Publishing. 

[40] Majda, A.J., Bertozzi, A.L. and Ogawa, A., 2002. Vorticity and incompressible flow. 

Cambridge texts in applied mathematics. Appl. Mech. Rev., 55(4), pp.B77-B78. 

[41] Mémin, E., 2014. Fluid flow dynamics under location uncertainty. Geophysical and Astro- 

physical Fluid Dynamics, 108(2), pp.119-146. 

[42] Mikulevicius, R. and Rozovskii, B.L., 2004. Stochastic Navier-Stokes equations for turbulent 

flows. SIAM Journal on Mathematical Analysis, 35(5), pp.1250-1310. 

[43] Mikulevicius, R. and Rozovskii, B.L., 2005. Global L2-solutions of stochastic Navier-Stokes 
equations. The Annals of Probability, 33(1), pp.137-176. 

[44] Robinson, J.C., Rodrigo, J.L. and Sadowski, W., 2016. The three-dimensional Navier-Stokes 
equations: Classical theory (Vol. 157). Cambridge university press. 

[45] Street, O.D. and Crisan, D., 2021. Semi-martingale driven variational principles. Proceedings 
of the Royal Society A, 477(2247), p.20200957. 

[46] Strichartz, R.S., 1967. Multipliers on fractional Sobolev spaces. Journal of Mathematics and 
Mechanics, 16(9), pp.1031-1060. 

[47] Temam, R., 1977. Navier-Stokes equations: Theory and numerical analysis(Book). Amster- 
dam, North-Holland Publishing Co.(Studies in Mathematics and Its Applications, 2, p.510. 


110 D. Goodair and D. Crisan 


Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 
International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, 
adaptation, distribution and reproduction in any medium or format, as long as you give appropriate 
credit to the original author(s) and the source, provide a link to the Creative Commons license and 
indicate if changes were made. 

The images or other third party material in this chapter are included in the chapter’s Creative 
Commons license, unless indicated otherwise in a credit line to the material. If material is not 
included in the chapter’s Creative Commons license and your intended use is not permitted by 
statutory regulation or exceeds the permitted use, you will need to obtain permission directly from 
the copyright holder. 


On the Interactions Between Mean Flows = @ 
and Inertial Gravity Waves in the WKB or | 
Approximation 


Darryl D. Holm, Ruiao Hu, and Oliver D. Street 


Abstract We derive a Wentzel-Kramers-Brillouin (WKB) closure of the gen- 
eralised Lagrangian mean (GLM) theory by using a phase-averaged Hamilton 
variational principle for the Euler-Boussinesq (EB) equations. Following Gjaja and 
Holm 1996, we consider 3D inertial gravity waves (IGWs) in the EB approximation. 
The GLM closure for WKB IGWs expresses EB wave mean flow interaction 
(WMFTI) as WKB wave motion boosted into the reference frame of the EB equations 
for the Lagrangian mean transport velocity. We provide both deterministic and 
stochastic closure models for GLM IGWs at leading order in 3D complex vector 
WKB wave asymptotics. This paper brings the Gjaja and Holm 1996 paper at 
leading order in wave amplitude asymptotics into an easily understood short form 
and proposes a stochastic generalisation of the WMFI equations for IGWs. 


1 Introduction 


Inertial gravity waves (IGWs), also known as internal waves, comprise a classical 
form of wave disturbances in fluid motions under gravity that propagate in three- 
dimensional stratified, rotating, incompressible fluid and involve nonlinear dynam- 
ics among inertia, buoyancy, pressure gradients and Coriolis forces [23, 17, 24]. 


Satellite Images and Field Data Satellite Synthetic Aperture Radar (SAR) is a 
powerful sensor for ocean remote sensing, because of its continuous capabilities 
and high spatial resolution. The spatial resolution of the state-of-the-art satellite 
SAR images reaches 20—30 m, and the swath width reaches 100—450 km. Figure 1 
shows a typical representation of the range of SAR field data and Fig.2 shows a 
typical SAR image of IGWs on the ocean surface. 
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Fig. 1 The distribution of observed IGW packets and bathymetry in the South China Sea courtesy 
of [25]. Bold lines represent crest lines of leading waves in IGW packets interpreted from SAR 
images. The rectangular box on the right of this figure outlines the IGW generation source region. 
Looking closely near the center of this figure, one sees the crescent shape of the Dongsha atoll 
whose diameter is 25 km. Details of SAR images of waves near Dongsha atoll are shown in Fig. 2 


Fig. 2 A satellite image 
showing the strong surface 
signatures of IGWs in the 
South China Sea near 
Dongsha atoll. Notice also the 
pronounced roughness of the 
surface due to surface gravity 21730N $ 
waves through which the 
IGW surface signatures 
propagate. For discussion of 
other observations, see, e.g., 
[16] 21°00'N # 


22°00'N & 


Refracted Waves 
Pe Oe 


X 


118°00'E 


117°00'E 


Theoretical Basis of the Present Work The paper [5] derived a hierarchy of 
approximate models of wave mean-flow interaction (WMFI) for IGWs by using 
asymptotic expansions and phase averages. Two different derivations of the same 
WMFI IGW equations were given. The first derivation was based on Fourier 
projections of the Euler-Boussinesq equations for a stratified rotating inviscid 
incompressible fluid. The second derivation was based on Hamilton’s principle for 
these equations. Two small dimensionless parameters were used in the asymptotic 
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expansions. One small parameter was the ratio of time scales between internal waves 
at most wavenumbers and the mesoscale mean flow of the fluid. This “adiabatic 
ratio” is small and is comparable to the corresponding ratio of space scales for the 
class of initial conditions that support internal waves. The other small parameter 
used in the asymptotic expansions was the ratio of the amplitude of the internal 
wave to its wavelength. An application of Noether’s theorem to the phase-averaged 
Hamilton’s principle showed that the resulting equations conserve the wave action, 
convect a potential vorticity and can, depending on the order of approximation, 
convect wave angular momentum. Legendre transforming from the phase-averaged 
Hamilton’s principle to the Hamiltonian formulation brought the WMFI theory into 
the Lie-Poisson framework in which formal and nonlinear stability analysis methods 
are available [15]. The Hamiltonian framework also revealed an analogy between 
the two-fluid model of the interaction of waves and mean flow with the interaction 
of the superfluid and normal fluid components of liquid He^ without vortices. 
The relations to similar results for the Charney-Drazin non-acceleration theorem, 
Whitham averaging, WKB stability theory, Craik-Leibovich theory of Langmuir 
circulations as well as the generalised Lagrangian-mean (GLM) fluid equations for 
prescribed wave displacements were also discussed in [5]. 


Goal of the Present Work Our goal here is to derive 3D IGW equations in the 
class of wave mean flow interaction (WMFI) derived in [5] as a mutual interaction 
of the mean fluid flow and the slowly varying envelope of fluctuation dynamics that 
is consistent with IGWs in the full 3D Euler-Boussinesq fluid flow. Physically, we 
take nonhydrostatic pressure effects on the wave dispersion relation into account 
and derive consistent nonlinear feedback effects of the internal of waves on the 
generation of fluid circulation based on a dynamic version of the well-known Craik- 
Leibovich theory of Langmuir circulation [3]. Mathematically, we introduce the 
two WMFI degrees of freedom by factorising the full 3D Euler-Boussinesq flow 
map into the composition of two smooth invertible maps in Hamilton's principle for 
Eulerian fluid dynamics [14]. 


The present work derives 3D equations for wave mean flow interaction (WMFT) 
as WKB wave motion boosted into the reference frame of the fluid equations for the 
Lagrangian mean transport velocity. The final equations derived here are consistent 
with traditional approaches such as Craik-Leibovih (CL) theory [3] except that the 
Stokes drift velocity in the CL formulation has its own dynamics in the present 
formulation. The present formulation can also be considered as a WKB closure 
for the GLM approach [1], similar also to the oscillation centre ponderomotive 
closure in magnetohydrodynamics [18, 19]. Namely, the present formulation uses 
a combination of asymptotic expansion and phase resonance to close the GLM 
equations derived by the composition-of-maps approach and obtaining explicit 
formulas for the wave polarisation parameters and dispersion relation for the 
Doppler-shifted frequency. 

Finally, the present work also formulates stochastic equations of motion for 3D 
WMFI dynamics, permitting a statistical representation of the uncertainty present in 
observational data of geophysical flows. 
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2 Deterministic 3D Euler-Boussinesq (EB) Internal Gravity 
Waves 


2.1 Lagrangian Formulation of the WMFI Equations at 
Leading Order 


GLM Theory The Generalised Lagrangian Mean (GLM) theory of wave mean 
flow interaction (WMFT) is derived in Andrews and McIntyre [1] by taking the time 
mean (-) at a fixed position x of the Eulerian fluid velocity, U(x, t), shifted to a 
rapidly fluctuating position, xë :— x + œ (x, t) with constant scale factor œ < 1 
and zero Eulerian mean &(x,t) = 0. The Lagrangian mean velocity ua (x, t) at 
Eulerian position x is then defined via the following calculation, 


UG , t) := U(x c a&(x, t), t) = u (x, t) + a (x t) 
where U(x + a£(x, t)) =: ul (x,t), (2.1) 
with 
d L e zL L 
ee) = 88 t (u .V)&£ =: u", uf=0 and ul =u“. (2.2) 
Consequently, the Kelvin circulation integral for GLM in a rotating frame with 


constant Coriolis parameter 282 may be derived; see, e.g., [1, 5, 6, 7, 9, 10] and 
the appendix for details, 


Igi U”) = f (u(xé, f) + Q x x5) - dx? 


c(uL) 


= f (u^ (x, t) + Q x x) - dx (2.3) 
c(uL) 


+ o? (Q x E(x, t) + u (x, 1) - d£(x, t). 


The Lagrangian transport velocity for GLM in (2.3) is indeed u^. However, the 
Eulerian momentum per unit mass in the integrand of the GLM circulation integral 
in (2.3) acquires an order O (a?) shift due to the mean effects of the quadratic 
nonlinearity in the last fluctuating displacement terms in (2.3). 


Choice of GLM Closure Gjaja and Holm [5] studied the dynamics of 3D IGWs 
by closing the GLM theory for the case that the fluctuation displacement o£ (x, t) 
in (2.1) is given by a single-frequency travelling wave 9t (a(ex, et)e!? €X-€)/€) with 
slowly varying complex vector amplitude a(ex, t) and slowly varying, but rapid 
phase $ (ex, et)/e, with e « 1; so that the time averaged Lagrangian mean of the 
displacement field w& (x, t) would be negligible. 
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We choose to represent the fluctuation displacement field & (x, t) in the following 
form 


E(x, t) = alex, ere! XDE + a* (ex, etje SEDE (2.4) 


and the total pressure decomposes into 


p(X, t) = po(X, t)» Lo! (o; (eX, ered € X enje i b*(eX, eije IRn) , 
jzl 
(2.5) 


Here the adiabatic parameter €e is defined as the ratio between space and time 
scales of the wave oscillations and mean flow respectively. Thus, quantities that 
are functions of x and t, for example (x,t), have fast dependence on x and 
t. Likewise, quantities which are functions of ex and et, for example a(«ex, et), 
have slow dependence of the space and time coordinates. Thus, in the fluctuation 
displacement £ in (2.4), the fast phase dynamics is represented by exp i$ (ex, et)/e 
which is slowly modulated by the complex vector amplitude a(ex, et). 

We will apply the GLM closure in Eqs. (2.4) and (2.5) to the 3D Euler- 
Boussinesq equations, which can be derived from Hamilton's principle with the 
following reduced Lagrangian 


ti 1 
o=3 | f 2 (si? v: a x- sez) + pa - exar. (2.6) 
to M 2 


where Zd?X = d?xg € Den(R?) is the fluid density, o € F(R?) is the fluid 
buoyancy, and M is the spatial domain. Substitution of (2.1), (2.4) and (2.5) into the 
Euler-Boussinesq Lagrangian in (2.6) followed by asymptotic expansion ina < 1 
and e < 1 at order O(a”) neglecting corrections at orders O(ae) and O (a^) and 
phase averaging (i.e., keeping coefficients of resonant phase factors only) produces 
the following wave mean flow interaction (WMFT) closure for Hamilton's principle 
in Eulerian fluid variables, which splits into the sum of the average mean-flow action 
Lyr and the average wave action Lw, given by [5] and derived in the appendix as, 
cf. Eq. (A.8), 


ti _ _ 
0 = Gur + Sw)=6 | (Lur +a°Lw) dt 
to 
2 E: 222,2 I~ 
zu f p| lu l^ +u” -Q xx- pgz +a © |a| + 2ia@Q . (a x a*) 
to M 


9? po 
Xj OX; 


—a7i (bk - a* —b*k - a) o^ajaj- Ja D) po + Ole) d?x dt. 


(2.7) 
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The averaged fluid quantities ua (ex, et), D(ex, et) and p (ex, et) are defined to 
have slow dependence on x and t in the averaging procedure. To see the construction 
of slow dependence from Lagrangian labels, see section (2.1) of Gjaja and Holm 
[5]. In the averaged wave Lagrangian Ly, the wave vector and wave frequency are 
defined in terms of the wave phase ¢ (ex, et), as 


ð 
k(ex, et) :— Vegh (€x, €t) and w(ex,e€t):= E €t). (2.8) 
€ 
The Doppler-shifted oscillation frequency, ©, due to the coupling to the mean flow 
u is defined through the advective time derivative ES = 3 + u} . Vex and the 
wave phase as 


à à $ à $ + ul - Vexo uł -k (2.9) 
w := = . —0— è E * 
det det PS 


Upon introducing the Doppler-shifted oscillation & into Lw in (2.7) and pairing 
its definition in (2.9) with a Lagrange multiplier, N, one arrives at the following 
variational principle 


ti 


0 = 8(Sur + Sw) = af (Lur +o?Ly) dt 


to 
ti 1 
- Ji I DET tu... Q9 x x— pgz +a 9 al” + 2io? ðQ. (a x a“) 
to M 


9? po 
0xjOx;j 


— o? i (bk. a* — b*k- a) — o2 afa; ] a Dros 


a m 
+a? (N, m a e Vds 2E O(a?e) dt. 
€ 
(2.10) 


Since, it may not be immediately clear how to take variations of the action (2.7). 
The inclusion of the Lagrange multiplier, N, imposes the relationship among the 
Doppler-shifted frequency @, the Lagrangian mean velocity u^, and the phase ¢, 
thereby facilitating the variations. Namely, the forms of the constrained variations 
of the velocity field u^ and its advected quantities, D and p, are shown in (2.14). 
All other variables have arbitrary variations. The Euler-Poincaré theorem can then 
be applied to the variational derivatives with respect to u^, D, and p, to obtain 
an equation for the total momentum of the system, and stationarity of the action 
with respect to the remaining variables implies a collection of equations for the 
remaining dynamics. This procedure results in a closed system of equations for 
both waves and mean flow, and describes their mutual interaction. For Hamilton's 
principle of least action to apply to an asymptotically expanded action, we make use 
of the following definition to formalise the idea of Hamilton's principle to a given 
order, in the situation where the action is expanded asymptotically. 
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Definition 2.1 (Variational Derivatives in an Asymptotically Expanded Action) 
When making an asymptotic expansion in Hamilton's principle, the Lagrangian 
in terms of any new variables, eu, D, p) for example, becomes an infinite sum. 
Variational derivatives are then defined under the integral up to some order, i.e. 


07 asa f tu’, D. pd 
ôly2 L ôl2 

= ô x 
[Gs dd EE: 


where the truncated Lagrangian, £,2, is defined as the part of the Lagrangian which 
corresponds to these variations 


(2.11) 


Ly, 
l sD) + » io) +000, 
p 


£(u, D, p) = £,2(u, D, p) + H.O.T.. 
Note that we have declined to use the *big O' notation in the above equation, since 
£,2 is defined to include all terms of order less than a?e as well as any higher order 


terms which produce lower order terms after integrating by parts to take variational 
derivatives. 


Hamilton's action principle (2.10) yields the following variations up to order 


O(a?) 


0= sf (Lur " e?Ly) dt 
-[ (ow, Du. 4-D9 x x— o? N Ve] + (80, —Dgz) + (6b, -oik - a) 
+ (ab, oik - a) 
+ (ja. a ? (Dea + 2i@a* x Q + ib*k — (a*- V)V po)] 
+ (ja* a 2 (ng? a+2iða x € — ibk — (a: V)Vpo)) 
+ (335, 2o? D(Slal? + iQ- (a x a*)) — a? N)+ (8D, w) 


slow, — Leet nee 


+ 


o. c — N + divex(u’ N) + idivex(Dba* — Dia) d 
( 


+ (po, 1— D) + O(a’). 
(2.12) 
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where we have 
gis ô(LuF + a?’ Lw)/8D 


1 
= lul — pgz t u^ Q x x— po 


Ns = 9? 
+a? (sia? enis. x a*) — i (bk - a* — b*k - a) — ata; ) , 
` OX; OX j 


(2.13) 
The constrained variations in (2.12) take the Euler-Poincaré form [14] 
L.vxv—v-. Vexu., 59=—-v-Vexp, ôD =-—div.x(vD), 
(2.14) 


ð 
ôu“ = — v+u 
det 


where the € appears in the derivatives of the constrained variations due to their slow 
dependence on space and time. Note that when isolating the arbitrary variations, 
v, through integration by parts, Vex does not generate higher order terms when 
operating on c. From the constrained variations, one has that p and D are advected 
by the flow which then satisfies the following advection equations 


a a 
— D -divx(u D) 20, —p +u}. Vexp =0. (2.15) 
det det 


The total momentum of the mean and fluctuating parts of the flow is defined through 
the variational derivative w.r.t to u^, which is given by 


M := Du + DQ x x—a’ NV. xd, (2.16) 


which through the Euler-Poincaré theorem [14], satisfies the Euler-Poincaré equa- 
tion 


y (M ans Mao latas i Pus 
de DJ T € un P RO Kar 


N 8? po 
2 —2 2 * = 
+a Vex (-o5 + wal“ + aja; Tais =0, 


(2.17) 


where Z := Vxz. Stationarity of the sum of actions Sy pr + a? Sw in (2.10) under 
variations with respect to the fluid variables (u^, D, p) has produced the equations 
for the mean flow, with order O (o?) wave forcing which arises from the dependence 
of a*Lw on the fluid variables D and p. We note from the variation in po that 
incompressibility of the Lagrangian mean velocity holds only within the asymptotic 
regime, and does not hold in an exact form. Indeed, 
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32 
D=1 -e — — (Dažaj)=1 +0) = divx(u^) = O(@’e). 
ðexiðexj 1 
(2.18) 


Conservation of Wave Action Density Keeping only resonant combinations in the 
Lagrangian Lw in (2.29) has eliminated any explicit dependence on the phase, ¢. 
Hence, a symmetry of the Lagrangian under constant phase shift, 6 — $+ ġo, has 
arisen. Consequently, one expects that Noether's theorem will yield a conservation 
law for the conjugate momentum N under variations in $ of the average wave 
Lagrangian, Ly. The arbitrary variation 5@ in (2.12) reveals the definition of N 
as 


ôL - 
N := d = 2D(lal? 4- i$ - a x a*), (2.19) 
w 
and the arbitrary variation ô in (2.12) produces the following wave action 
conservation law, 


ON ai (N(u* +ve))=0, wh iD ath — ab*) — 2D s (ab*) 
== iv u^ +vg))=0, where vg := = — 9 , 
det ™ C SUUM N 

(2.20) 
Thus, the wave action N is transported in an Eulerian frame by the sum of the 
Lagrangian mean velocity u^ and the group velocity of the waves, vg, defined above 
in (2.20). The evolution equation of ¢ in (2.9) can be written in terms of N as follows 


D $a. v $= — ( 
det e 2DIal2 


N —2DiQ-ax a*) , (2.21) 
thus removing the explicit dependence on ©. The Eqs. (2.20) and (2.21) are in fact 
canonical Hamilton’s equations boosted to the reference frame of the mean flow u^ 
which is discussed in Sect. 2.2. 


Remark 2.1 (Boundary Conditions for Integrations by Parts) In taking varia- 
tions of wave properties, we are not considering a free upper boundary. Instead, we 
have set 

~ * p - ~ L _ 

(Qi .óa*)a. = =0 and $89 fi- N(u^ + va) =0, (2.22) 

x 

on the boundary, when integrating by parts. This means that the displacement of the 
wave amplitude and the flux of wave action density are both taken to be tangential 
to the boundary. 


Combining the evolution equation of wave action density N (2.20) and wave 
phase $ (2.9), one has the evolution equation of the internal wave momentum 
p/D := a? NVexo/D. 
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2 
—— =u” x (Vex x 2) + Vex (u: . P) = E (NV © +kdivex(NvG)) . 
(2.23) 


The Euler-Poincaré equations for the total momentum (2.17) and wave momentum 
(2.23) may be assembled into the Euler-Poincaré equation for the mean flow 
momentum, m = Du! + D@ x x. Dividing this through by the advected mass 
density, D, gives the following equation for u^ 


ð 1 1 4 
—u- ux curlex (v +Qx x) + Vex —jul |? + po | —gpoz 
det 2 € 
N 8? po 
= 2 ~ ~2) 12 oy 
= —aVex (-95 + lal” + aja; Suan) (2.24) 


2 
= S (N Vex © + kdivex(Nvo)) . 


Remark 2.2 (Hydrostatic and Geostrophic Balances) As explained in section 
2 of Gjaja and Holm [5], at leading order O(1/e) the motion equation (2.24) 
establishes hydrostatic and geostrophic balances, namely 


Opo(x, €t) — 


, 2.2 
3x 0 (2.25) 


22 x ul (ex, et) + gp(x, et)Z + 
In order to provide the restoring force for internal waves, the advected relative 
density (or, buoyancy) p(l (x, t)) must have one derivative of order O(1) with 
respect to the vertical coordinate z. In order to contribute to the wave component 
of the pressure gradient at order O(a”) in the motion equation (2.24), the mean 
pressure po must have two derivatives of order O(1) with respect to the vertical 
coordinate z. 


Remark 2.3 (Kelvin’s Circulation Theorem for WMFI) The two Euler-Poincaré 
equations for the total momentum M and mean flow momentum m readily implies 
their respective Kelvin-circulation theorems. Namely, for the mean flow momentum 
m, (2.24) implies the following 


d 


1 
— (i c 3) due d —pgz-dx 
det c(uL) € 


L 
oe (2.26) 


= -2 $ D^ (Nv.xG 4 kdivex(Nva)) : dx, 
c(uL) 


in which one notes that the internal wave terms contribute to the creation of 
circulation of the mean flow at order O(a”). For the total momentum M, Eq. (2.17) 
implies that 
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d 


1 
2 (u +Q xx- a?D-INk) dx d -pgz-dx=0. (227) 
det c(UL) 


c(u^) € 


Thus, just as for the introduction of Stokes drift velocity into the integrand of 
Kelvin's circulation theorem in Craik-Leibovich theory [3], one may regard the 
additional non-inertial force of the internal waves on the mean flow circulation 
as arising from a shift in the momentum per unit mass in the Kelvin circulation 
integrand, performed to include the internal wave degree of freedom. 


Legendre Transforming Wave Lagrangian Lw Into Canonical Phase Space 
Variables By using the definitions of N and @, one can compute the Legendre 
transform of Lw to obtain the following WMFI Hamiltonian Hw, 


Aw := (N, &) iw = f D (@lal? + i (bk - a* — b*k - a) 
M 


32 
+ařaj e) d?x 
ioxj 


i (2.28) 
=f Diaz (N — 2iD@-ax a*)? +iD (bk - a* — b*k- a) 
M 


9? po d? 


T Dana OXj OX; 


x , 


where we have dropped the dependence on higher order terms O (oe, a^) in the 
asymptotic expansion. Inserting the expression (2.28) for Hw into (2.10) yields the 
phase space expression of Lw as 


" 3p ıı 1 ; 2 
Lw=] -N(— V —— 3À(N -2iDQ. ij 
" 3 Can ab) Da oe 
: (2.29) 
s: * * * ð PO 2 3 
— iD (ba* — b*a) - Vp — Dajaj + O(o*e) d^x. 
'OxjOxj 


Remark 2.4 (Physical Interpretation of GLM WMPFI) The variations of the 
WKB mean wave Lagrangian Lw in the variables M and $ recover canonical 
Hamiltonian WKB wave equations (2.20) and (2.21) for N and $. These canonical 
equations have been boosted into the reference frame of the Lagrangian mean 
transport velocity u^. Moreover, the Lagrangian mean transport velocity u^ satisfies 
the Euler-Boussinesq equations on the left-hand side of Eq. (2.26). Thus, the phase 
space expression of the wave Lagrangian Lw provides the physical interpretation 
of the WKB mean wave motion in GLM. Namely, GLM expresses WMFI as WKB 
wave motion boosted into the reference frame of the Euler-Boussinesq equations 
satisfied by the Lagrangian mean transport velocity, u^, and its corresponding 
pressure, po, and density, o. The dependence of the wave Lagrangian o?Lw on 
the fluid variables D and p implies from its variation in po that incompressibility 
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of the Lagrangian mean transport velocity, u^, no longer holds exactly. Indeed, 
Eq. (2.18) shows that the divergence of ul is of order O(a@7€), which would need to 
be considered when going beyond the order of asymptotics O(a) considered here. 


Remark 2.5 (Potential Vorticity (PV) Advection Theorem for WMFI) Rewrit- 
ing the indicated operations in the Kelvin circulation theorem for WMFI after 
applying the Stokes thereom gives us 


1 
(8; + ££, )d(D^ M. dx) + -gdp ^ dz =0, (2.30) 
€ 


where £* denotes the Lie-derivative taken w.r.t to the rescaled basis ex. Since D 
and p are advected, i.e. they satisfies the advection equations (2.15), one finds 


(3+£6,) (a(D7' M- ax) ^ dp) 
- £O (D^ Vex -curlx (D^ M) Dà?x) — 0. (2.31) 


Consequently, one finds the following total advective conservation law for WMFI 
potential vorticity PV, 


ð 
agi th - Vexq =0 where g:=D7!Vexp - curlex (u^ +Qxx+ a? D~' Nk) . 
€ 
(2.32) 


Solving for Wave Polarisation Parameters/Lagrange Multipliers p, b, b*, a and 
a* The quantities p and b in (2.10) are Lagrange multipliers which impose the 
incompressibility constraints for volume preservation D = | and transversality of 
the wave vectors k - a = 0, respectively. The complex vector wave amplitudes a 
and a* are also Lagrange multipliers whose variations impose a linear relationship 
among most of the wave variables. In particular, stationarity of wave action Sw 
under variations of wave polarisation parameters b and a* gives, respectively, 


j 
k-a=0 and Ba — 2M x a— (a- VS = idk, (2.33) 
x 


from which b follows easily from the first constraint, upon taking the dot product of 
k with the second constraint, 


|k|?ib = —2i0(@ x a) -k —k- (a- V)V po = =k! (2168; + (po) Ja : 
(2.34) 


where Qj — —éijk € and the complex vector amplitude a is found from the 3 x 3 
Hermitian matrix inversion, 
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ge (poi | i329» — (po)12 12213 — (po)13 aj ky 
1@22i2 — (po)12 a? — (po)Do2  1@2223 — (po)23 | Pi | a2 | =ib| ka |, 
12013 — (po)13 ©2223 — (po)23 © — (po)33 a3 k3 
(2.35) 


in which the 3 x 3 matrix P, given by 


kikj 
Pj = (5; = x2) 


projects out the component along k of the complex vector amplitude a € C?. 


An Index Operator Form of the Polarisation Constraints The wave polarisation 
constraints in (2.33) and (2.35) may be rewritten in index form as 


9? po 


0xJ ax! 


so a" Djja! — 0. (2.36) 


alk; =0 and Dija = ibk; with Dij = Dij + io2Q; — 


The index operator form in (2.36) of the polarisation relations for (a, b) in (2.33) 
suggests a more compact representation of the wave Lagrangian, Lw, as we discuss 
next. 


Representing the Wave Polarisation Parameters a and b as a Complex Four- 
Vector Field After an integration by parts using the boundary conditions in (2.22), 
the Eulerian action principle in (2.10) may be expressed equivalently as 


ti - 
0 &Gur £a? Sy) = f (Lyr +o’Ly) dt 
to 


11 D 
= JI T (2i? + Du’ .Q x x — gDpz — p(D — 1) (2.37) 
to M 2 
+a’ DF”* D, F” + O(c?e, «) dxdt, 


where, for notational convenience, the fields a and b comprise a complex “four- 
vector field”, 


F" = (a, b)", 
with u = 1, 2, 3, 4, and the Hermitian dispersion tensor D,,, = D is given by 


9? po 


Dij = o^ + i@2Q;; = FEEMI 


Q Daj =ikj =—Dj4, Da4-0. 
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It is clear from the decomposition of the WMFI action in (2.37) that stationarity of 
Sw with respect to variations of the fields F = (a, b)? yields linear relations among 
the wave parameters (a, b) that recover the polarisation relations in (2.33) 


DF” =0. (2.38) 


Equation (2.38) recovers the linear constraints in (2.33) on the polarization eigendi- 
rections of the field F” up to an overall complex constant that can be set at the initial 
time. 


Doppler-Shifted Dispersion Relation The solvability condition det(D,,,) = 0 for 
(2.38) now produces the dispersion relation for the Doppler-shifted frequency of 
internal gravitational waves (IGW), 


à (o — u^ ky? = (— 2 gu! Va 


det 
(22 - k)? kikj\ 9 po 
= ái ) Fu 2.39 
Ik|? (s; Ik|2 7 dx! 8xJ Se 


which is independent of the magnitude |k| of the wave vector k, except for the 
Doppler shift due to the fluid motion. Formula (2.39) updates the phase $ of the 
wave at each time step. The complex vector amplitude a is then found from inversion 
of the 3 x 3 Hermitian matrix in (2.35).The remaining wave quantity b is then 
determined from (2.34) at a given time step. 


Remark 2.6 Under conditions of hydrostatic balance and equilibrium stratification, 
when u^ — 0 and the pressure Hessian pij has only the p33 component, Eq. (2.39) 
reduces to the well-known dispersion relation for linear internal waves [23]. For 
non-equilibrium flows, though, Eq. (2.39) shows the sensitivity of the propagation 
of internal waves to the pressure Hessian. 


2.2 Hamiltonian Structure for the WMFI Equations at 
Leading Order 


Thus far, we have considered a Legendre transform within the wave Lagrangian (see 
Eq. (2.28)). It remains to perform the same calculation for the mean flow to see the 
full Hamiltonian structure of the model. We define the momentum of the entire flow 
by 


M := Du + DQ x x—a’NVexo —:m-p 
with m := Du’ + DQ xx, and p:—o?NV.xó. (2.40) 
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In the above definition, the momenta m and p are the mean and wave parts of the 
momentum respectively and the total momentum, M, is the variational derivative of 
the constrained Lagrangian (2.10) with respect to the Lagrangian mean velocity. We 
perform both the wave and mean flow Legendre transforms concurrently as 


h=(M, u^) +0? (N, œ) — Lup -Lw 


(2.41) 

= (Du! + D x x, u^] + o? (N, ©) Lupa Ly. 

The resulting WMFI Hamiltonian then follows as 
1 2 
h(M, D, p, p, N) =i xp M+P- D(Q x x)| 
2 2 
D(N 

+ Dpgz + Tum (5 — 2iQ - (a x 3) 

lal (2.42) 


+ (bpa —*p-a) 


92 
4 a? Da*a; n tO - Dpot x. 
i OX; OX; 


The variational derivatives of the constrained Hamiltonian (2.42) may be determined 
from the coefficients in the following expression, 


n= f | - mop Decio ew im - (1— D)ópo 
iD 
tof [s - ^ (bk -a* - b*k-a) [SN 
+ [u^ + vc] . 6p + ia2D(sbk - a* — 8b* k - a) (2.43) 


— o? 5a" (Da + 2i D&(@ x a) 


f 9 \ ðpo 244 
iDbk D(a. x) ree | + O(ae) dx, 


Remark 2.7 (Discussion) 


e The quantity w = —óh/8 D = 8(Lur + o? Lw)/8D is the Bernoulli function 
given by Eq. (2.13) now expressed as a variational partial derivative holding fixed 
the other variables in A arising in the Legendre transform. 

* The Hamiltonian A in (2.42) is stationary for variations in the diagnostic variables 
(a, a*, b, b*, po). The stationary variations in these variables written in (2.43) 
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generate the constraints on the prognostic variables and the relations among 
the diagnostic variables. The solvability condition for these relations among the 
diagnostic variables determines the dispersion relation for the WKB IGWs. 

* The N and p equations can combine to yield 


Oe; Kk + Vexo = 0. 


This is the so-called ‘conservation of waves’ equation, which imposes equality 
of cross derivatives of the phase function, $ (ex, ef). 


The above variational derivatives can be assembled into the following untan- 
gled Lie-Poisson Hamiltonian form which separates the dynamics of the total 
momentum M in (2.40) and the advected fluid variables, D and p, from the wave 
momentum p and wave action density N, 


Mj Mxk0¢; + 9e Mj Doe; — Pej 0 0 
3 D Og D 0 0 0 0 
spe |57 Pek 0 0 0 0 
Pj 0 0 O proc) + ex pj Ne; 
N 0 0 0 Oe IN 0 
8h/8My = ut * 
óh/8 D = —w 
x óh/óp = D gz : (2.44) 
ôh/ôpr = (u^ + vc)" 
8h/8N = o? à 
Here, we are using a shorthand notation for the derivatives, dej = 9/ ðexj for 


example, and we have used the constraint that k - vg = 0 in taking the variations in 
b and b*. 


Remark 2.8 The untangled Lie-Poisson Hamiltonian form in (2.44) of the ideal 
wave mean flow system of equations derived in the previous section from the GLM 
Hamilton's principle represents a constrained Lie-Poisson Hamiltonian fluid system. 
Its Lie-Poisson bracket is defined on the dual of the direct sum of two semidirect- 
product Lie algebras 


Xror®@(Fur © Denyr) © (Kw. Fw). 


Dual variables in L? (IR?) pairing are the following, whose definitions also explain 
the geometric meanings of the standard calculus notation for the (MF) and (W) 
variables. 


* The total momentum 1-form density M 2M-dx& d?x e A! (IR?) & Den(R?)) 
is dual to the vector fields Xror (R5). 
* The density D = Dd?x € Deny r (R?) is dual to scalar functions Fy r (R°). 
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* The scalar function p € Fy p (IR?) is dual to densities Deny p (R?). 

* The wave momentum 1-form density P = p: dx & d?x € A! (IR?) & Den(R?)) 
is dual to the vector fields X w (IR?). 

* The wave action density N = Nd?x is dual to scalar functions Fw (R3). 


Remark 2.9 (Preservation of PV Casimirs) Notice that the Casimir functions for 
the Hamiltonian structure of GLM WMFI in the upper left block diagonal of the 
Lie-Poisson operator in Eq. (2.44) are in the same form as for the Euler-Boussinesq 
fluid, except they have been modified to accommodate the wave momentum. 
Consequently, no Casimir functions have been gained or lost in coupling the mean 
flow to the fluctuations. 


Canonical Structure of the Wave Dynamics 

The wave dynamics above are written in their Lie-Poisson Hamiltonian structure. 
Should we return to the canonical variables, N and $4, then the standard canonical 
structure emerges. Indeed, substituting p = œ? NV¢ into the Hamiltonian (2.42) 
and taking variations gives 


a 8h " 

ud e c aul - Vexó — 070, (2.45) 
ƏN 6h 

ae xa —a* divex (Nu.) — a?i divex (D(ba* — b*a)) . (2.46) 


Tangled Version of the Lie-Poisson Hamiltonian Structure By writing the 
Hamiltonian in terms of the mean flow momentum, m, rather than the total 
momentum, M, we recover the tangled version of the Lie-Poisson Hamiltonian form 
of the equations. Above, as in [13], we have presented wave-current interaction in 
its untangled form. In a previous work [12], the authors presented both the tangled 
and untangled forms, and an analogous calculation is also possible for this model of 
WMFI. Indeed, the Hamiltonian ^ (M, D, p, p, N) in (2.42) becomes 


1 
h'(m, D, p, p, N) = I IPLE DQ x x)|? 


2p (N : 
+ Dose + sas (p22 a) | 
x (2.47) 
iD 
—(bp-a*—b'*p.- 
+ na p.a p-a) 


9? po 


2 * 
Daa: 
TA R dj OxjOx 


+(D— Dp] ds. 
j 


! The constant factor of œ? appearing within the canonical structure has emerged due to the choice 
of multiplying the constraints in Hamilton’s principle by the same constant. 


128 D. D. Holm et al. 


The variational derivatives are largely the same, with differences only in the 
variation with respect to p. The tangled form of the Hamiltonian equations in the 
Hamiltonian A (m, D, p, p, N) in (2.47) is 


mj Me; + Qem j Doe; — Pej PRO; + dek Pj N06¢j 
3 D Oe D 0 0 0 0 
Jet p = Pyek 0 0 0 0 
Pj Pk9ej + Vek Pj 0 0 Pkõej + Vek Pj N9,j 
N Oe N 0 0 dek N 0 
dh’ /ómy = ul* 
6h'/5D = —w 
x | bh’/60 = Dez |. (2.48) 
8h! /Spe = V$ 
bh'/6N = 070 


Instead of the direct sum in the untangled case in Remark 2.8, this tangled Lie- 
Poisson bracket is defined on the dual of two nested semidirect-product Lie algebras 


(Xur©@(Fur 6 Denyr)) © (XwOFw). 


Corresponding dual variables in L^(R?) pairing are similar to those explained in 
Remark 2.8. 


3 Stochastic WMFI 


Stochastic equations of motion may be used in fluid dynamics to model uncertainty, 
and such equations may be derived through Hamilton's principle [8]. Such stochas- 
tic terms can be used to parametrise unresolved ‘subgridscale’ dynamics absent 
in computational simulations, and as such are particularly relevant to geophysical 
applications. 

Motivated by the fact that, due to computational limitations, the mean flow may 
only be solved for on a coarse grid when considering large scale geophysical flows, 
we apply the method of stochastic advection by Lie transport [8] to the mean flow 
map, g;. This may be done as 


dgixo = (u^ o gj)xo dt + Y (6; o &)xo o dW}, (3.1) 


i 
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where W/ are independent and identically distributed Brownian motions and od W; 
denotes Stratonovich integration.” This is equivalent to 


dgg (x) = u (xj) dt + 2 t; (X) o dW] =: dx, 


L 


and we see that the Lagrangian mean velocity, u^, has been stocastically perturbed. 
By an application of the Kunita-It6-Wentzell formula [4], we see that 


dX, = dgrg, ! X, = dx, + a? (d£, (x) + dx, - V£,(xj)) . (3.2) 


Should we assume that the entire motion, corresponding to U; = 22^, also has a 
stochastic part, corresponding to EB. then we have 


U, dt + M3 t$ odW} = dx, + o? (d£, (x1) + dx, - VE, (x). (3.3) 


L 


The uniqueness of the Doob-Meyer decomposition then indicates that each p 
decomposes into a part corresponding to the mean flow, ¢;, and a part corresponding 
to the wave motion, which we call o ;. . [21]. 


Remark 3.1 Following Street and Crisan, [21], by the compatibility of €, with the 
driving semimartingale, we have a representation dé = Ao dt + 9^; Ai o dW!. The 
uniqueness of the Doob-Meyer decomposition then gives U; = u^ + a? (Ao Tub. 


Vex£;) and o; = Cic o? (Ai téj- Vex£;). 


WMEFI is not limited to temporally averaged terms. The variability of WMFI must 
also be considered. This consideration results inevitably in differential equations for 
the slow components of the climate system, which include stochastic transport and 
forcing terms. There are many ways of introducing stochasticity into the WMFI 
system. Some guidance in this matter can be found, e.g., in [10]. 

In this section, we will consider two distinct framework of introducing stochas- 
ticity into Hamiltonian fluid systems. The first option laid out here in this section 
enables wave and fluid dynamics to possess different stochastically fluctuating com- 
ponents in their transport and phase velocities, as follows, in which variations of the 
deterministic Hamiltonian below are the as those in Eq. (2.43). The introduction of 
the stochastic vector fields to the WMF evolution equations can be accomplished by 
making the deterministic Hamiltonian to the WMFI a semimartingale in each degree 
of freedom. The chosen augmentation of the Hamiltonian is based on coupling noise 
by L? pairings of spatially varying noise *modes' with the momentum maps dual to 
the respective velocities for each degree of freedom, m, p, and N. The coupling is 
done such that the variational derivatives with respect to the momentum variables 


? The notation o may be used to denote both composition and Stratonovich integration. 
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will add stochastic transport terms to each of the corresponding dual velocities, as 
follows, 


dh f hn A A E Í 
= — |m — xx —— | — — 212. (ax 
2D PET Hale (D 


iD 2 po | 3 
+ (bp a-b" pea) +a ae a d°x det 


+ J (D — 1)dpo d?x 


+E [mi toooawi dx +Y [ p- oio aB Ps. 
(3.4) 


where dW; and d B] are chosen to be distinct Brownian motions and ¢;(x) and 
0; (x) in principle need to be determined from calibration of transport data of each 
type, and leading eventually to uncertainty quantification. We have introduced a 
stochastic component of the pressure, thus introducing the notation dpo, following 
the framework of semimartingale driven variational principles [21]. The influence 
of the stochastic terms on the Lie-Poisson Hamiltonian dynamics can then be easily 
revealed, as 


mj my0cj + Qeym ; Dj — Pej Pkõej + Qex pj N8ej 
D dex D 0 0 0 0 
a| p |=- Pek 0 0 0 0 
Pj PkOej + OekPj O — O0 — px8ej + Qexp; N8ej 
N OQ, N 0 0 dek N 0 


5dh/dmy = u'* det + cf (x) o aWi, 
ôdh/8D = 7 det + dpo 
x ôdh/ôp = Dgz det , (3.5) 
dh/8py = vi, det + of (x) o d Bi, 
ddh/5N = a? @det 


where 7r is given by 
wT —-—U — po, 


for w as defined in Eq. (2.13). The Hamiltonian variables are as defined in the 
deterministic case, 


E L ilu iD * * _ 2d, * 
m:= Diu’ +Q xx), p:=a°Nk, yc i= a en cS 
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These variables have already appeared in the integrand of Kelvin’s circulation 
theorem in (2.27). The stochastic version of the GLM Kelvin circulation theorem 
for Euler-Boussinesq incompressible flow in Eq. (2.27) thus becomes 


ag D^ M-dx- dd (+g xx- D^lp) . dx 
c(dx;) c(dx;) 


1 
=-is$ p dz det , (3.6) 
€ c(dx;) 


in which the material loop c(dx;) moves along stochastic Lagrangian trajectories 
given by the characteristics of the following stochastic vector field 


N 
dx, = u (x, dt +Y t, (x) oaW? . (3.7) 
a=1 
A Stochastic Canonical Structure in the Wave Dynamics 
The canonical structure between the wave variables N and $, noted in equations 
(2.45) and (2.46), now becomes stochastic. Indeed, substituting M and p — 
a? NVex@ into the action and taking variations gives 


dh ; " 
a?d$ = == —o7ul . Vexo det — a? » ti- Vexó odWl, — a? det 
—a? Y Vaxó oi o d Bis, (3.8) 
i 
dh 


a?dN = 367 —o? divex (NuL) det — o? 2. divex(NE;) o dWi, 


t 


— oi divex (D(ba* — b*a)) det —œ° V divex (No;) o dBi, . 
i 
(3.9) 


Such a stochastic generalisation of Hamilton’s canonical equations has been noted 
and discussed for wave hydrodynamics previously [20] for the classical water wave 
system. 


An Alternative, Energy-Conserving Approach to the Incorporation of Stochas- 
tic Noise 

The second option of introducing stochasticity into the WMFI system is through the 
modification of mean flow and wave momentum to include different stochastically 
fluctuating components. The introduction of the stochastic momentum can be 
accomplished by making the deterministic Lie-Poisson bracket of the WMFI system 
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to include stochastic components. Following [11], the chosen modification is the 
addition of “frozen” Lie-Poisson bracket multiplying semi-martingales. The fixed 
(frozen) parameters in the frozen Lie-Poisson brackets are the spatially, possibly 
temporal varying noise “modes” which are transformed by the deterministic trans- 
port and phase velocities in the same way as the deterministic momentum. Let i; 
and y; denote the stochastic fluctuations of the mean flow and wave momentum 
respectively, the stochastic Lie-Poisson equation can be written as 


mj Mk Ve; F Qem j Dàj — Pej Pkõej + Qex pj Ne; 
D dex D 0 0 0 0 
d| p |=- Pek 0 0 0 0 
Pj DkOej Op; O O  piOej + Ok pj Ne; 
N Oy N 0 0 Qe N 0 
u.* det 
—7 det + dpo 
x D gz det 
ve det 
ao det 


(aide + derai) od Wi, 00 (wiae + daw!) od Bi, 0 
0 00 0 0 
-> 0 00 0 0 
i (vic ES daw!) o dBi, 00 (vic; n daw!) o dBi, 0 
0 00 0 0 

uL K 

—7t det + dpo 
x Dgz 


(3.10) 


Here, the stochastic component of the pressure dp is added as before following 
the semimartingale driven variational principle [21]. Similarly to the stochastic 
vector fields £/ and o, we need to determine A’ and y! through calibration with 
existing data for each type of momentum. The influence of the stochasticicty on 
the circulation dynamics of the mean flow and wave momentum is clear from the 
following modified Kelvin-circulation theorem 
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L lu 
d (u^ + € x x) dx + - pg 
c(uL) c(uL) € 


+o2D7! (NV ex + k divex(Nve)) .dxdet 


3 , ; 
D (wu x — x X — Vex (ul a - dx o dWi 
xu b» Fas (v x em x eX (u XodW;, 


ð , 
—1 i M . ae 
+) Pau? (ve X za x W Vax (ve " )) dxo dBi, =0, 
(3.11) 


where the loop is moving with the deterministic velocity field. By construction, 
the Eq. (3.10) preserves the deterministic energy path-wise as the Poisson struc- 
ture remain anti-symmetric and the variational derivative of the Hamiltonian is 
unchanged. However, the modification of the Poisson structure implies that the 
standard EB fluid Casimirs are no longer conserved. 


4 Conclusion 


In this paper we have derived a closed system of equations for the interaction of 
a GLM flow with the slowly varying envelope of a WKB field of internal gravity 
waves (IGW) by incorporating the two approximate descriptions into Hamilton’s 
principle. Building on the work of Gjaja and Holm [5], we have shown that this 
approach boosts the canonical equations for the WKB IGW into the reference frame 
of the Lagrangian mean transport velocity, u^, satisfying the Euler-Boussinesq 
equations on the left-hand side of Eq. (2.26). Thus, GLM expresses WMFI as WKB 
wave motion boosted into the reference frame of the Euler-Boussinesq equations 
satisfied by the Lagrangian mean transport velocity, u^, and its corresponding 
pressure, po, and density, o. The dependence of the wave Lagrangian a*Ly on 
the fluid variables D and p implies from its variation in po that incompressibility 
of the Lagrangian mean transport velocity u^ does continue to hold for the order 
O (o) asymptotic expansion treated here. 

We have further demonstrated how stochasticity in the fluid can permeate through 
both the wave and mean flow dynamics, and that such terms can be included 
through the variational structure. Moreover, this paper has identified the nested 
semidirect-product Lie-Poisson structure possessed by the Hamiltonian formulation 
of the GLM WMFI equations. The continued preservation of the fundamental Lie 
algebraic structure for the Hamiltonian formulation of the stochastic GLM WMFI 
system implies that its data calibration and uncertainty quantification can still 
be treated systematically using the stochastic advection by Lie transport (SALT) 
approach [8]. Future work will focus next on deriving a 2D vertical slice model 
for these 3D GLM WMFI equations and developing data calibration methods for 
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the 2D vertical slice model, in order to investigate the inclusion of mean internal 
gravity wave effects on the responses of the stochastic Eady problem. 
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Appendix: Asymptotic Expansion 


This appendix fills in details of the derivations of the approximations discussed in 
Sect. 2. Namely, the displacement of a fluid element from its mean trajectory is 
represented by 


X, = x; + aé(x, t), (A.1) 


and the associated velocity is given by 


d 
U,(X,) =u! (x, 1)-ca (8G. thtuk - Vx, £(x;, )) =u" (x, ta F(x. D). 
(A.2) 


The fluctuating terms are assumed to have a WKB structure, lending the pressure an 
associated slow/fast decomposition 


E(x, t) =a(ex, ete! t XDE 4 oF (ex, ete WROD | (A.3) 


P(X, 1) =polX, +) a! (bj (€X, eee Xe pi (EX, ete UMMM) | 
jzl 
(AA) 


Making these approximations within a fluid governed by the Euler-Boussinesq 
equations may be performed by substituting them into Hamilton’s principle, 
asymptotically expanding, and truncating to leave only the leading order terms. The 
relevant variational principle in this case is as follows 


ti 1 
o= f l 2 (510? +U- xX- sez) pa - DaX di. 
t M 
° (2.6 revisited) 
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We first note that the volume form must be written in terms of the mean basis, as 

2QX)d?x = 95 (x)d?X = DË (x) J dx =: Dd?x, (A.5) 
where 


ag! 
BÀ — det (v; + 23 : 


Similarly, o also transforms as 
oX) = oF (x) —: p. 


Before calculating the terms featuring U, note that 


da ; 0d ; 
HEX, t) tuk - Vx, &(x;, t) = e——e ^ 4 ai 99 oie 
det det 
+e af gre ea" ae gible 
det det 
+ eel?f* uL . Vexa + iaei? ul -Vexo 
4r ee! PI uL. . Vexa” — ia*e™ tul - Vero (A.6) 
. a ; ə 
— iei ( 99 4 ul. Vexo | + ia*e7'?/€ np oh. Vexó 
det det 


9a 9a* 
+ eens (ss + ul k vae) + gels E + u^ i Vaa") 


= —igael?/* + iõa*e i’ + Ole), 


where we define © := -Le = -20 + u- . Vexo and k := Vex¢ as in (2.9) 
and (2.8). The may now calculate the energy terms, beginning with kinetic energy, 
making use of the above relation and taking the mean.? Note that the following 
relations are true within the Lagrangian, but are expressed here in isolation. 


1 
LJU]? 
lu 


1 
FUE + 0 (a8 (wi, D) + wh» Vx, EG. 0)" 


1 
= jl -F o?|8,& (xy, t) + UË - Vx, £x, DI? 


1 " 
jl f + 20° 0a - a* + O(a?€). 


3 In taking the mean within the action integral, we discard the terms multiplied by rapid oscillations 
exp(+iġ/€) and exp(+2i¢/e). These non-resonant terms are assumed to oscillate to zero under 
the time integral. 
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The rotation term and potential energy are 


U.QxX— (u: + (E(x), t) + u! - Vx, E(X, )) -Q x (x taé(x, 1)) 
=u". Q x x u^. 9 x (a£) +. o(0,£ +u! - Vy £) -Q xx 
+ o (9,& + ul - Vy, &) - Q x (o£) 
=u’ -Qxx+a7(,é +u} - Vx, E) Q xE 
=u} -Q x x 1 2io?9Q . (a x a*) + O(a’), 
goZ = gp(z + a£) = gpz. 


Within the pressure term, we need to take care of expansion in both p5 and J. We 
have 


(1— Dy pi PX — CF — D)pi ax = v - Dp) dx 
Dealing with the terms separately, we have the expanded expression for pë 


apo o? 3? po 3 
: E: + (o? 
OX; 5 2 ðxiðxj 5j qna 


p*(x) = po(x, r) +a 


ae? ab; 


2 OEXj OEXE 


. ab; 
ijb; Lg ; 33]. 
* Doe ( eS ema mem + &&y + O(a ) 
ap o??? 0° " 

i I O .C. 
je 9 3x A )) eec 


ij 
exp ( (e (ex, et) --a€ 


apo o? 8? po 3 
; £40 
Ax; RE, 0x;üxj ssi en 


; ab; 
(bj LE; + Od? 
+ oa CLE (o >) 


j<1 


x exp (=) (1 + iia & + ow) + c.c 


= poca 


a? 9?po 
2 OX; OX ; 


0 
= pota a &iéj 
OX; 


ð 
s) 
EX; 


j ob 
+ exp (+) c +e ee + biia? a 
t 


2i 
+ exp (ens c.c + O(a), 
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which gives the phase averaged expression 
2 


a 8? po 
P PT (aia; + a7a)) 


db ð ab; 0 
+a? (a tibia; 2 a; —ibřa;i 3 + O(a). 
L 


O€X; dexi O€X; 


Note that 


98; 98; 98; Ej 3 
= det ( ài; cq 28i; — o) 
p4 (sj ea) Felge ij vr "LEE (o) 


ooh. 95; i (= d& — 0&3 0&| " 0&3 0£; Ə, 0&i 


0x1 0x2 0X3 0X1 0x3 0x2 0X2 0x2 


0& 0&3 — 0& =) +003) 


0x3 0x1 0x3 Oxo 


ig (E: oig ae) exp (ig /e) + (= ‘at 2) exp ( iere) 
Ox; € xj əxi € OX; 


+ a? (26;; — 1) (zz + lai z) exp (iġ/€) + ec) 


j 
x (= + ta) exp (iġ/€) + cc) + O(a’). 
Oxi e` Xi 


Taking the phase average gives 


= aa 430) [98% uuo 
F =1407(28;; o((# LUNES alee Lege) eee) eoe 
j i i 


ð 
=1+i t. a (a x a*) + O(a), 


where the last equality uses the fact that we are operating under a spatial integral 
and integration by parts applies. Then, we have 


a? 9?^po 
2 ðxiðxj 


j ob ə 
+ exp (2) c + are hi + biia? Ma &) 
i i 


2i 
+exp( s 7b, + c.c 


dPo 
pe Tp RE &£; 
t 
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98; 2985; ðpo tep (2 $ 98; 
Ter bi — 
P a n e) ax; 
ə 
+ pow? (26;; = 1) 5 Per 4t c.c 4 O(o?). 
OX; OX; 


Applying phase averaging gives 
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We may assemble these statements into the following action integral, which may be 
regarded as an approximation of (2.6). 
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(A.7) 
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We now seek to simplify this integral. Firstly, we note that the following relation- 
ships hold for the last four terms on the second row of Eq. (A.7) 
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after integration by parts. We have thus far involved several of the order o? terms on 
the third line of (A.7). The remainder of these are handled as follows 
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Assembling this back into the action integral (A.7) finally yields the expression for 
S in (2.7), 


ti 1 
s= | J D| Zuti? ret? — pgz +u! -Q x x+ i'i (a x a”) 
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32 
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Quentin Jamet, Etienne Mémin, Franck Dumas, Long Li, and Pierre Garreau 


Abstract Current climate models are known to systematically overestimate the rate 
of deep water formation at high latitudes in response to too deep and too frequent 
deep convection events. We propose in this study to investigate a misrepresentation 
of deep convection in Hydrostatic Primitive Equation (HPE) ocean and climate 
models due to the lack of constraints on vertical dynamics. We discuss the potential 
of the Location Uncertainty (LU) stochastic representation of geophysical flow 
dynamics to help in the process of re-introducing some degree of non-hydrostatic 
physics in HPE models through a pressure correction method. We then test our ideas 
with idealized Large Eddy Simulations (LES) of buoyancy driven free convection 
with the CROCO modeling platform. Preliminary results at LES resolution exhibit 
a solution obtained with our Quasi-nonhydrostatic (Q-NH) model that tends toward 
the reference non-hydrostatic (NH) model. As compared to a pure hydrostatic 
setting, our Q-NH solution exhibits vertical convective plumes with larger horizontal 
structure, a better spatial organization and a reduced intensity of their associated 
vertical velocities. The simulated Mixed Layer Depth (MLD) deepening rate is 
however too slow in our Q-NH experiment as compared to the reference NH, a 
behaviour that opposes to that of hydrostatic experiments of producing too fast MLD 
deepening rate. These preliminary results are encouraging, and support future efforts 
in the direction of enriching coarse resolution, hydrostatic ocean and climate models 
with a stochastic representation of non-hydrostatic physics. 
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1 Introduction 


Deep ocean convection is a crucial mechanism for large scale ocean circulation and 
climate. It controls the rate of deep ocean water masses formation, sequestrating 
atmospheric properties such as heat and carbon in the abyssal ocean. In the North 
Atlantic basin, deep ocean convection in the Labrador Sea and the Nordic Seas 
is part of the large scale Atlantic Meridional Overturning Circulation (AMOC), 
an oceanic metric with many climate implications (Zhang et al. 2019). Coarse 
resolution (i.e. Ax ^ O(100) km) climate models are known to overestimate the 
rate of deep water formation at high latitudes in response to too deep and too 
frequent convective events (Heuzé 2017; 2021), a bias that is expected to worsen 
with next generation climate models with ocean components at higher resolution 
(Masson-Delmotte et al. 2021). Among other possibilities (e.g. preconditioning, 
air-sea interactions), we explore in this paper the possible misrepresentation of 
deep ocean convection in current climate models in response to their hydrostatic 
formulation. 

Ocean modulus of current climate models solve the Hydrostatic Primitive 
Equations (HPE), a simplified version of the full Navier-Stokes equations (NS). 
Geophysical fluids have specific characteristics that allow some approximations 
from the general NS, leading to drastic simplifications in their numerical imple- 
mentation which, in turn, allow us to model the global ocean at climate scales (i.e. 
for several decades/centuries) with the currently available computational resources. 
Among those approximations is the hydrostatic balance which arises from the 
relatively thin thickness of the ocean (H ~ O(1) km) as compared to the horizontal 
extension of its large scale dynamics (£ ~ O(1000) km for gyres and £ ~ 
O (10 — 100) km for ocean mesoscale eddies). The aspect ratio 6 = a is thus 
orders of magnitude smaller than unity. Scaling the vertical velocity W = ôU, 
with U the typical horizontal velocity of the flow, leads to small contribution of 
vertical acceleration as compared to horizontal components. For a regime satisfying 
such a scaling, only vertical pressure gradients are able to balance gravitational 
acceleration in the vertical component of the NS equations, and the system can be 
simplified as: 


1 

SEN ES Oe a (1a) 
0 
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where u = (u, v, w) is the three-dimensional velocity field, f = 2X2sin(0) is the 


traditional Coriolis pseudo-force, p is pressure, b — wow is the buoyancy defined 
for Boussinesq fluids (i.e. when density p is replaced by its constant value po, 
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unless multiplied by gravity in which case it is expressed as density anomaly £ 1 2), 
and F and D are forcing and dissipative processes, respectively. Equations (1a)- 
(1c) are the HPE momentum equations used in current climate models. From a 
numerical viewpoint, using HPE instead of general NS or other Non-Hydrostatic 
(NH) sets of equation greatly simplifies the procedure as only (1a) and (1b) have 
to be stepped forward in time for each discretized ocean layers, while (1c) is 
used (diagnostically) to obtain the pressure field through vertical integration of 
density variations subject to gravitational acceleration. In HPE models, convection 
is part of the parameterized (i.e. unresolved) three-dimensional turbulence and 
mixing processes which are encapsulated in D, ,. Usually, these operators are 
formulated with a down-gradient approach, where the vertical fluxes of a scalar 
0 are parameterized as w’0’ = — Kọ 8,0, with 0 the local, resolved field. Several 
models can be used to estimate the dissipation coefficient Kg (e.g. TKE (Gaspar 
et al. 1990), GLS (Umlauf and Burchard 2003), KPP (Large et al. 1994)), but in case 
of convection, this coefficient is usually set to an unrealistically large value (0.1 to 
10 m?s-!) to quickly restore static instabilities associated with convective processes 
and avoid model instabilities. More recently, Giordani et al. (2020) proposed an 
oceanic application of the eddy-diffusivity mass-flux formulation initially derived 
by the atmospheric community (e.g., Hourdin et al. 2006, Suselj et al. 2019), 
which allows a better representation of vertical advective fluxes associated with 
convection. The approximations leading to HPE are likely to be satisfied in most of 
the ocean where vertical velocities are small and their spatial patterns are of small 
scales. However, for the case of deep ocean convection where vertical velocities 
can reach W ~ O(10 cm s7!) and over horizontal scales of £ ^ O(1 km), such 
approximations become questionable. In case such approximations turn out to be 
violated, it becomes necessary to find ways of re-introducing some form of non- 
hydrostasy within HPE. 

Klingbeil and Burchard (2013) have proposed a direct implementation of full 
non-hydrostatic effects into an HPE model through a pressure correction method. 
Instead of solving the full three-dimentional velocity field equations 


* 1 
uer ENN EM a as Eh (2a) 
0 
1 
aju+V-(uv)t+ fu ge es (2b) 
0 
= 1 
drw + V + (uw) fu= à; p — b + Fw + Dw, (2c) 
E po ~~ 
where T = 22cos(@) is the non-traditional Coriolis pseudo-force (shown for 


consistency but not considered in the following), and non-hydrostatic contributions 
are shown in blue. To avoid the complexity of solving a three dimensional Poisson 
equation to recover the non-hydrostatic pressure (as usually done in NH pressure 
correction methods, e.g. Marshall et al. 1997) Klingbeil and Burchard (2013) 
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proposed to account for non-hydrostatic pressure correction through a vertical 
integration of a so-called non-hydrostatic buoyancy, i.e. following the strategy of 
HPE models. This strategy offers a general implementation of NH physics in HPE, 
but still suffers from numerical instabilities in the case of strongly non-hydrostatic 
dynamics. For the case of deep ocean convection, it can be shown that further 
simplifications can be made by only accounting for the horizontal viscosity acting 
on the vertical velocities in the computation of the NH pressure correction (through 
vertical integration ; Pierre Garreau, personal communication). As will be shown 
later through the analysis of different idealized Large Eddy Simulations (LES), 
HPE models tend to produce convective plumes near the grid size of the model, 
leading to unstructured (on the horizontal) convective cells. From one grid point to 
the next, vertical velocities could be of opposite sign leading to intense horizontal 
gradients. Including a horizontal viscous operator on the HPE vertical velocities 
(we recall here that in HPE vertical velocities are diagnosed from the horizontal 
velocity field though continuity) leads to a broadening of the convective plumes 
and a more realistic horizontal organization. In other words, when convective 
plumes start to form, they ‘entrain’ the neighboring points thus communicating 
horizontally their vertical momentum. Such a process can be seen as a simplified 
entrainment/detrainment mechanism discussed by Giordani et al. (2020) for the case 
of edddy-diffusivity mass-flux parameterization. In the present study, we consider 
the approach of Klingbeil and Burchard (2013) as a starting point and discuss a 
strategy to extend this idea in the context of a stochastic parameterization. The 
results presented here are all obtained at LES resolution, such that a clear connection 
with climate scale regimes is still lacking. However, these results provide a first step 
toward the development of robust stochastic parameterization for climate models, 
which will be the subject of dedicated studies. 

The paper is organized as follow. In Sect.2 we briefly recall the Location 
Uncertainty (LU; Mémin 2014, Bauer et al. 2020, Resseguier et al. 2017) frame- 
work used to represent the inertial and dissipative effects on vertical momentum 
(underlined terms in (2c)) as a result of a strong noise regime or for application 
to flow dynamics where the hydrostatic approximation becomes questionable. 
Section 3 is dedicated to the numerical implementation of the stochastic, non- 
hydrostatic pressure correction into the terrain-following Coastal and Regional 
Ocean Community (CROCO), along with the description of the simulations we have 
conducted. Preliminary results are described and discussed in Sect. 4. We summarize 
our paper and provide some perspectives for further work in Sect. 5. 


2 Stochastic Formulation of Direct Non-hydrostatic Pressure 
Correction 


Following Mémin (2014), the stochastic version of the horizontal momentum 
equation (in vector notation) reads: 
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1 
Dun + fk x (undt + cdBP) = eet oe (3) 
0 
with uj = (u, v, 0), D; the stochastic transport operator defined as: 
. 1 
Du; = drun + (u*dt + od B,) V uj — gv * (aV uj)dt, (4) 
with u* the incompressible (i.e. V - o dB; = 0) modified advection defined as: 
j ly (5) 
u*=u— -V.a 
2 


where u = (up, w) is the three dimensional velocity field, odB, represents the 
stochastic flow and a its associated variance tensor. The term IV - à can be 
interpreted as an equivalent of the Stokes drift for an inhomogeneous random fast 
component o dB, (Bauer et al. 2020). 

The introduction of the stochastic pressure dp? in (3) requires some discussion. 
This stochastic pressure is associated with the small scale velocity component 
encoded through the noise. Following Resseguier et al. (2017), for smooth-in-time 
momentum equation subject to a classical deterministic large scale momentum 
equation, its (three dimensional) gradient can be expressed as: 


1 
— —Vdp? = (odB,) -V u +f x odB, (6) 
po 


such that its interpretation (and scalling) should be related to the processes the 
stochastic formulation aims at representing. In the context of large scale modelling 
parameterization such as Tucciarone et al. (2023), the stochastic Primitive Equations 
they derived is meant to represent the effects of meso (and potentially submeso) 
scale eddies onto the large scale gyre circulation. The usual hydrostatic arguments 
are thus used, such that the vertical gradient of the stochastic pressure is identically 
zero (i.e. 0;dp? = 0) and its horizontal gradient is strictly balanced by the 
stochastic Coriolis pseudo-force. Here, we are interested in relaxing the hydrostatic 
approximation on the noise structure, but retaining it for the smooth-in-time, 
resolved flow, and derive the stochastic equation for the vertical momentum. As a 
first step in this direction, we will not include the contribution of the non-traditional 
Coriolis pseudo-force. After some manipulations, we obtain the following equation 
for the vertical momentum: 


1 1 1 1 
(—=V -a)- Vwdt —=V - (aVw)dt + odB, - Vw = (——8,p — b)dt — —à,dp?, 

2 2 — po po 
(7) 


Black terms in (7) are associated with hydrostatic physics and terms in blue are 
the different stochastic contributions that emerge when applying non-hydrostatic 
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thinkings on the stochastic noise. The left-hand side terms corresponds to the 
vertical acceleration with a scaling such that the noise vertical acceleration is 
strong compared to the large-scale vertical acceleration terms. Note that the two 
underlined terms in (7) are Brownian terms emerging from the stochastic pressure 
formulation (6) on the right-hand side and from the vertical velocity transport by 
the noise on the left-hand side. The two other blue terms on the LHS are associated 
with modified advection and dissipation (projected on the vertical velocity w) that 
emerged through the three dimensional generalization of (4): 


1 1 
D,u = drun + u V uj + d adt +odB,) -V u — zy -(aVu)dt, (8) 


where the material derivative of vertical velocities associated with the resolved flow 
(i.e. d;w + u - Vw) has been neglected. 

The noise being given (and calibrated from data or a known relation), from (7), it 
is thus possible to compute the various Brownian terms on the LHS, then to integrate 
vertically the results to obtain a 3D map of the modified pressure field as a result of 
the noise transport. Separating safely the martingale part (Brownian terms) from the 
smooth-in-time components (“d?” terms), we have 


n 
dp? (z) = dp? |z=n + w | (cdB, - Vw)dz', (9) 


for the martingale component, and 


p(z)dt = p\z=ndt + pog(n — z)dt 


n 1 1 
+o f bdt — (Gv a): Vudt + 7 V -(a¥ wd) dz, (10) 
z 


———— 
bNH 


for the smooth-in-time component. The three last terms on the RHS of (10) can be 
compared to the deterministic non-hydrostatic pressure correction of Klingbeil and 
Burchard (2013), although the material derivative of w associated with resolved flow 
is not included in our stochastic formulation. It can be noted that our formulation 
involves a 3D diffusion of the vertical velocity ensuing from the noise action 
as well as the contribution of the modified Ito-Stokes term arising from the 
spatial inhomogeneity of the noise. Both (9) and (10) should be integrated with 
appropriate boundary conditions at 7 to incorporate fast and smooth-in-time surface 
pressure contributions (such as surface waves or atmospheric pressure loading, 
respectively), but such contributions can be neglected at first approximation without 
loss of generality. Results of (9) and (10) can then be used to feedback onto the 
horizontal momentum equation (3) solved by an hydrostatic model. Assuming a 
strict separation of the martingale part and the smooth-in-time component, only (10) 


Stochastic Parameterization for Oceanic Deep Convection 149 


is assumed to feedback onto the resolved flow. The martingale components are 
assumed to balance each other, thus not affecting the resolved flow. This assumption 
can be interpreted as a Large Eddy Simulation (LES) -like approach, as discussed 
by Bauer et al. (2020). 

As a preliminary step, we will further simplify the structure of the variance tensor 
a in order to reduce the second and third terms on the RHS of (10) to a simple 
Laplacian viscosity—induced here by the noise contribution. This simplification 
is motivated in the following. In the LU framework, the strength of the noise is 
measured by its (one-point co-) variance, such that 


a(x, t) $ G(x, x,t), (11) 


with q(x, x, t) a matrix kernel defined as 
jo x. à [ ocx nex nl ax" (12) 
Q 


with ð (-, -, f) a bounded matrix kernel defining the correlation deterministic integral 
operator o; : L?(2) > L?(Q) 


TE Í, S(x,y.) fdy, Vf e CQ). (13) 


(See, e.g. Bauer et al. 2020, Mémin 2014, Resseguier et al. 2017, for further 
details). Although the previous definition of the noise is general, it is possible, 
through the Mercer's theorem, to express the noise variance as a spectral decom- 
position of the form: 


a(x, t) = Y m Opn (x, 097 (x. t), (14) 


neN 


where $, (x, t) define an orthonormal eigenfunction basis of the correlation oper- 
ator, o;, with A,(f) their corresponding eigenvalues. For a stationary noise, this 
reduces to a classical POD (or EOF) decomposition, in which the eigenfunctions 
are the solution of the eigenvalue problem 


/ K(x, x')b,(x')dx! = A4, (x) (15) 
Q 


with K the two-point correlation tensor. 

The next step is to assume isotropy and homogeneity of the noise structure, in 
which case the Fourier modes @, = &?"/** are a natural choice to satisfy (15), 
which implies (Berkooz et al. 1993): 


K - b s mL (16) 


n 
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Under isotropic condition, the variance of the divergence-free noise is constant and 
diagonal, such that the first term associated with by g in (10) is identically zero, and 
the noise induced dissipation reduces to: 


1 
gV:(aVw) = vAw, (17) 


with v the isotropic, homogeneous noise induced momentum dissipation. Through 
vertical integration of (17), we recover part of the non-hydrostatic pressure correc- 
tion proposed by Klingbeil and Burchard (2013), which in the present case mimic 
entrainement/detrainement of convective plumes leading to changes in their spatial 
organization. This modified HPE will be termed Quasi-Nonhydrostatic (Q-NH), 
by analogy with the Quasi-Hydrostatic (QH) of Marshall et al. (1997) where non- 
traditional Coriolis terms are added into the HPE. 


3 Numerical Implementation and Simulations 


Our objective is to implement this stochastic, non-hydrostatic pressure correction 
in the hydrostatic kernel of the Coastal and Regional Ocean Community model 
(CROCO ; http://www.croco-ocean.org). CROCO is a new ocean model that builds 
upon the structure of the ROMS-AGRIF primitive equation solver (Shchepetkin 
and McWilliams 2005, Debreu et al. 2012). The non-hydrostatic, non-Boussinesq 
(NQB ; Auclair et al. 2018) capabilities of CROCO will also be used to construct a 
reference simulation for validation (see Table 1). We review in the following some 
important steps for the implementation of the stochastic pressure correction within 
CROCO, discuss their implications and how we treat the pressure correction within 
the hydrostatic CROCO kernel. 


3.1 Stochastic, Non-hydrostatic Pressure Correction 


In its hydrostatic mode, CROCO computes “, from which horizontal gradients 
directly feed the baroclinic horizontal momentum equation (i.e. (2a) and (2b)). Our 
strategy is to include the NH pressure correction via a modified density/buoyancy 
field, such that the pressure field becomes: 


= n (p — po) — b 
p) E Plz=n +f (o — po) NH V az, (18) 
PO PO z PO 


with by gj collecting the different contributions of the vertical momentum equation 
(w. trends) contributing in the pressure correction, normalized by gravity: 
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1 
bNH —— 5 w_trends, (19) 
g 


where the horizontal dissipation of vertical velocity (e.g. Eq.(17)) is computed 
along sigma coordinates. Our strategy is similar to Delorme et al. (2021), who 
derived a Quasi-Hydrostatic version of CROCO by including the non-traditional 
Coriolis effects through buoyancy correction. 

Let us note that by y is abusively denoted through a buoyancy variable, however 
it corresponds to corrections brought by the noise to the usual hydrostatic pressure. 
Such a correction should not be interpreted as an actual modification of the 
density/buoyancy field of the stratified ocean. Thus, the stochastic contribution is 
not included in the specific treatment of baroclinic-barotropic mode coupling of 
CROCO, which aims at accounting for the non-uniform density field for the prop- 
agation of gravity waves (Gill 1982), ultimately reducing the usual mode-coupling 
error associated with mode-splitting schemes (Shchepetkin and McWilliams 2005). 
In other words, we do not expect this ‘non-hydrostatic buoyancy’ to affect gravity 
wave’s propagation. 

Finally, CROCO uses a third-order predictor-corrector (LF-AM3) time-stepping 
scheme for tracers and baroclinic momentum. This scheme consists of a Leapfrog 
(LF) predictor with 3rd-order Adams-Moulton (AM) interpolation. It also uses split- 
explicit techniques to robustly couple the slow, baroclinic and the fast, barotropic 
modes associated with the time evolving non-linear free surface. A complete 
description of the several stages of CROCO time-stepping can be found in Section 5 
of Shchepetkin and McWilliams (2005). This predictor-corrector, split-explicit 
scheme implies some tendency terms of the baroclinic mode are computed twice 
to step forward baroclinic momentum and tracer equations from time step t to time 
step t + At. The first computation is performed at the prediction stage, and the 
second computation is performed at the correction stage. These tendencies include 
pressure gradients. To avoid double counting the stochastic pressure correction and 
for stability reasons, the modified non-hydrostatic buoyancy is computed only at the 
correction stage. 


3.2 Numerical Experiments 


The numerical experiments we used to test our stochastic non-hydrostatic pressure 
correction are oceanic deep convection events. The configuration is inspired by 
free convection studies (e.g. Souza et al. 2020) where an horizontally uniform 
surface cooling is applied to a constantly stratified, horizontally uniform ocean. 
Although our interest is on deep ocean convection rather than mixed layer free 
convection, we have adopted an horizontally uniform setting (as usually done in 
free convection) instead of a horizontally structured system as proposed earlier 
by, e.g. Marshall and Schott (1999), where surface cooling is confined within a 
specified region (usually a disc). This setting allows the analysis of interacting 
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Fig. 1 Illustration of the 3D 7 
structure of the simulation 4009 m 200 m 
after three days of simulation 


in a non-hydrostatic setting. 
The color shading is 
temperature 


W 0001 


convective plumes with non-convective environment. However, such configurations 
are usually conducted at coarser resolution and oriented toward process studies 
of the geostrophic organization of convective plumes. Here, our focus is on 
parameterization and we wish to start with simplified settings in order to capture the 
essence of deep convection dynamics; interactions with a prescribed background, 
non-convective environment is left for further work. 

From this horizontally uniform and vertically constant stratified initial condition, 
the model is stepped forward in time on a 100x 100x 100 discretized grid points 
with isotropic resolution of 10 meters, and exposed to a constant (in time and space) 
cooling rate of Qr = —500 W m^? heat flux (Fig. 1). This leads to a cooling of 
upper ocean layers, which ultimately become unstable through static instabilities as 
a result of a negative buoyancy frequency (N? = —£8;b) « 0, thus undergoing 


convection. The current settings are run with no Coriolis forcing, ie. f = 0 s7!; 
inclusion of Coriolis effects will be the subject of further work. The model is 
initialized with stochastic perturbations on ocean upper layers temperature decaying 
with depth (following Souza et al. 2020) to trigger the formation of convective 
plumes 


10 


Tæ y Do TG) Y (eme), 1) Vo? C/E (20) 
(m,n)-0 
with T(z) = T|;2o — œz, T|.—-o = 3 K and o is a constant defined as a = AM 


(ar = 0.2048 K7! is the thermal coefficient, g = 9.81 m s~? is gravity and oo = 
1024 kg m^? is reference density). The second term on the RHS of (20) is the 
stochastic perturbation defined as the sum of plane waves with random phase m,n 
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Table 1 Summary of the experiments and their numerical details. NBQ stands for the non- 
hydrostatic, non-Boussinesq CROCO kernel of Auclair et al. (2018); NH, Hydro and Q-NH stand 
for Non-Hydrostatic, Hydrostatic and Quasi-Nonhydrostatic; WENOS and C4 for the 5-th order 
and the 4th-order centred advection schemes; KPP for the K-Profil Parameterization of Large et al. 
(1994) 


Name Kernel Hz adv Vert. adv Closure (Ax, Az) At Vw 


NH NBQ WENOS |WENOS |- (10, 10m) |2.5s  |(implicit) 
Hydro Hydro WENOS C4 KPP (10, 10m) 25s B 
Q-NH |Hydro |WENOS |C4 KPP (10,10m) /|25s |I1n?s^! 


and of amplitude VV « Vo2, where M is a Gaussian white noise distribution and 
o? = 107? K? represents the variance of the stochastic perturbations. The random 
phases are drawn from an uniform distribution over the range [0, 1]. 

This configuration has been integrated forward in time to produce several 
numerical experiments in order to assess the performance of our implementation. 
It includes a pure Non-Hydrostatic (NH), which make use of the non-hydrostatic 
non-Boussinesq capabilities of CROCO (NBQ, Auclair et al. 2018), and a pure 
hydrostatic (Hydro) reference experiments. We then compare the solution produced 
by our Quasi-Nonhydrostatic (Q-NH) experiment, which includes the stochastic 
pressure correction, with the solutions produced by Hydro and NH. 

For both Q-NH and Hydro, the KPP (Large et al. 1994) closure scheme is used 
to represent vertical sub-grid scale mixing. As stated in introduction, this scheme 
mimics vertical fluxes through a dissipative down-gradient operator. In the case of 
static instability associated with convective events, the dissipation coefficient is set 
to Kg = 0.1 m’s~! in CROCO. Sensitivity tests (not shown) using TKE (Gaspar 
et al. 1990) closure scheme instead revealed that the choice of the closure scheme 
has little effect on the solution produced by our Hydro experiment. 

Horizontal dissipation of vertical velocity in Q-NH is set to vy = 1 m?s7!, 
which corresponds to high values of dissipation estimated through a Smagorinsky- 
like approach Vsmago = $42, (xw)? + (dyw)”), with Axy = 10 m the horizontal 
resolution of our configuration, and a = 0.2. Table 1 summarizes the different 
experiments, along with some numerical details. 

Finally, note that all three experiments are conducted at the same isotropic 
resolution of 10m. Evaluating the performance of our Q-NH model for climate scale 
regimes (i.e. with horizontal resolution much coarser than vertical resolution) will 
be the subject of further work. 


4 Results 


We show on Fig. 2 snapshots of the vertical velocities as simulated by NH, Hydro 
and Q-NH after 3 days of simulation. Obviously, the NH experiment produces 
weaker and larger scale structures as compared to the two other experiments. It 
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Fig. 2 Horizontal (top) and vertical (bottom) sections of vertical velocities for the Non- 
Hydrostatic (NH, left), the Hydrostatic (Hydro, center) and the Quasi-Nonhydrostatic (Q-NH, 
right) run. Snapshots are shown after 3 days of simulation with all other components (forcing, 
dissipation, stratification, resolution) held constant 


is noticeable, however, that the amplitude of w in Q-NH is reduced as compared 
to Hydro, with larger scale structures of the convective plumes. In the Hydro 
experiment, plumes are localized near the grid scale and exhibit almost an order of 
magnitude larger vertical velocities as compared to the NH reference. The reduced 
vertical velocities in Q-NH and broadening of the associate spatial scale of the 
plumes can be interpreted as a result of the entrainment/detrainment mechanism, 
which is here simply represented as a purely horizontal viscous stress on vertical 
velocities. 

Aside from vertical velocities, it is also instructive to analyse the consequence for 
the temperature profile, an indication of the capability of convection in producing 
deep water masses. Figure3 shows the horizontally averaged temperature vertical 
profile, along with the rate of the Mixed Layer Depth (MLD) deepening, for 
the different experiments. A first remarkable result is the similarity between the 
temperature profiles produced by all the experiments within the MLD (ie.z < 
—300 m). Additional tests (not shown) indeed reveal the very weak sensitivity to 
numerical implementation (i.e. non-hydrosatic, hydrostatic with different vertical 
mixing schemes), as well as the level of dissipation. The most noticeable differences 
between the experiments appear at the base of the mixed layer. In particular, the 
consequences of too strong vertical velocities in Hydro is to produce too deep water 
masses with a too strong penetrative convection (i.e. the envelop at the base of 
the mixed layer where water masses are warmer than their initial state). Although 
vertical velocities in Q-NH remain significantly larger than those produced by NH, 
the effects of horizontal viscous forces on the vertical velocities is to significantly 
damp the penetration of convective plumes bellow the mixed layer, inducing a 
strong reduction of water masses formation. The reduction of penetrative convection 
induces significant biases in the deepening rate of the MLD (Fig.3, right panel). 
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Fig. 3 Horizontally averaged temperature profiles (left) and deepening rate of the mixed layer 
depth (MLD, right) for the different runs at the end of the 3-day long simulations. Deepening 
MLD rate are compared to the analytical estimates of Marshall and Schott (1999) and Souza et al. 
(2020) 


In Q-NH, MLD seats between the theoretical predictions of Marshall and Schott 
(1999) and that of Souza et al. (2020), where the former do not consider penetrative 
convection in their scaling while the latter do. That MLD is too shallow in Q-NH, 
as compared to NH, is likely a consequence of too strong dissipation imposed to the 
system. We note, however, that we have only considered the dissipative, rectification 
contribution (i.e. smooth-in-time) of the stochastic transport as a result of the strict 
separation assumption between martingale and smooth-in-time components. Further 
work are required to evaluate how the Brownian part of the stochastic transport 
impact the deepening of the MLD. 


5 Conclusion and Perspectives 


In this study, we detailed the first steps toward a full stochastic parameterization of 
deep ocean convection along with their implementation in the general circulation 
model CROCO. Our preliminary results, which consist of an approximation of the 
horizontal noise structure as homogeneous and isotropic, led us to recover part of 
the derivation provided by Klingbeil and Burchard (2013) in a deterministic case. 
Our results are encouraging, and we are now in a position of extending the current 
analysis to a fully consistent stochastic framework. 

The first step in this direction will be to implement the stochastic pressure 
noise contribution which comes in pair with the idealized Laplacian horizontal 
viscosity action on vertical velocities in the context of an hydrostatic simulation of 
deep ocean convection (i.e. Eq. (9)). Following previous work of Pierre Dérian and 
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Etienne Mémin (' Hyper-viscosity' noise for transport under location uncertainty), 
we will consider a simplified expression of the stochastic transport of vertical 
velocity to construct our stochastic pressure noise. This approach is meant to obtain 
the stochastic transport associated with Laplacian or hyper-viscosity dissipation as 
usually implemented in OGCM. With these considerations, it is possible to express 
the stochastic transport of (9) as: 


odB, -V w = PN 72312163 (21) 
k 


with eg a basis, defined here as Daubechies wavelets, y, denotes independent 
normally distributed variables, and A, are the wavelet coefficients defined as: 


E (V2edriv ww; ex) (22) 


me 


with e a scaling factor controlling the ratio of variance created by the noise 
to energy dissipation. Accounting for the stochastic pressure would also require 
considerations for the Brownian components of the stochastic transport in the 
horizontal momentum advection. These steps are part of further works to achieve 
a full, consistent implementation of LU transport in CROCO. 
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Abstract In recent years, stochastic parametrizations have been ubiquitous in 
modelling uncertainty in fluid dynamics models. One source of model uncertainty 
comes from the coarse graining of the fine-scale data and is in common usage in 
computational simulations at coarser scales. In this paper, we look at two such 
stochastic parametrizations: the Stochastic Advection by Lie Transport (SALT) 
parametrization introduced by Holm (Proc A 471(2176):20140963, 19, 2015) and 
the Location Uncertainty (LU) parametrization introduced by Mémin (Geophys 
Astrophys Fluid Dyn 108(2):119-146, 2014). Whilst both parametrizations are 
available for full-scale models, we study their reduced order versions obtained 
by projecting them on a complex vector Fourier mode triad of eigenfunctions 
of the curl. Remarkably, these two parametrizations lead to the same reduced 
order model, which we term the helicity-preserving stochastic triad (HST). This 
reduced order model is then compared with an alternative model which preserves 
the energy of the system, and which is termed the energy preserving stochastic 
triad (EST). These low-dimensional models are ideal benchmark models for testing 
new Data Assimilation algorithms: they are easy to implement, exhibit diverse 
behaviours depending on the choice of the coefficients and come with natural 
physical properties such as the conservation of energy and helicity. 
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1 Introduction 


The introduction of stochasticity in fluid dynamics has recently been the subject 
of intense research effort. This approach involves using random processes to 
model, for example, unresolved scales, or to take into account neglected physical 
effects. A stochastic formulation for the fluid flow introduces a probabilistic basis 
for modelling unresolved scales. This is different from the deterministic causal 
modelling which is difficult to achieve in practice due, for instance, to unknown 
initial conditions. In addition, some phenomena such as energy backscattering are 
directly accessible as stochastic processes. Another usage of stochastic modelling 
is to generate ensembles of realizations of the model. This facilitates the analysis 
of model uncertainty quantification for different low-resolution computational 
simulations and their usage to approximate the true state of the fluid, instead of 
using a single high-resolution numerical simulation. 

Some stochastic schemes have been proposed in the literature by considering 
a variety of ad-hoc perturbations. However, a principled approach is desirable. 
The formulation of stochastic dynamical systems based on physical principles 
has recently been proposed in various settings. For a review and classification of 
approaches to stochastic parameterisation based on physical principles, see [4]. The 
present work treats two additional new approaches. The first one, called stochastic 
advection by Lie transport (SALT) relies on the variational principle for fluid 
dynamics [16]. The second one, called modelling under location uncertainty (LU) is 
derived from Newton’s principle [23]. Both frameworks introduce stochasticity into 
the Lagrangian specification of the flow field, rather than directly into the Eulerian 
frame. 

In the deterministic case, it is known that three-dimensional fluid flows may 
trigger a cascade of dynamics across multiple length and time scales. This mul- 
tiscale behavior poses considerable challenges in the computational simulation 
using standard Navier-Stokes equations (see e.g. [7], [24], [5]). When modelling 
turbulence numerically, specialised discretisation methods are needed to decompose 
the underlying partial differential equations into a very large number of ordinary 
differential equations. Alternative approaches have been introduced where the 
Navier-Stokes dynamics in the Fourier space is mimicked using a finite number of 
variables, say ul, u?, — uN . The Fourier space is divided into N shells, and each 
shell s; comprises the set of wave vectors s with magnitude |s| € (so2/, so2/* !). 
Each u’ satisfies an ODE and it represents the magnitude of the velocity field 
on a length scale of s; : ([15], [7]). The quadratic nonlinearity in the Navier- 
Stokes equations produces triads of interacting vector Fourier modes within each 
shell. Shell models involving multiple triads have had considerable success in 
modelling energy and helicity cascades, as well as modelling intermittency in 
chaotic dynamical systems [6, 10, 9]. Simplified shell models with only a few triads 
date back to the 1970s and have provided major insight into fluid modal interaction. 
Even the dynamical system representation of Euler's fluid equations on a single triad 
has been quite insightful, see e.g. [28, 29]. 
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More recently, the problem of correctly parameterizing effects of small-scale 
physical processes together with the need for probabilistic ensemble forecasting and 
uncertainty quantification has led to modern stochastic approaches in the study of 
turbulence using reduced order shell models. In this work we will explore reduced 
order models for SALT and LU models obtained by projecting onto helical basis 
functions [6, 28, 29]. These helical basis functions, defined as eigenfunctions of the 
curl operator, enable one to construct reduced order stochastic models of fluid flow 
with a simplified nonlinear interaction. As we will see, under projection onto the 
basis of helical triad modes, both LU and SALT result in the same reduced order 
model and this projected model conserves helicity, but it does not conserve energy. 
Because of this coincidence in projecting the SALT and LU models onto the helical 
basis, a second reduced order scheme with a strong energy conservation property 
inspired by [17] and known as the energy preserving stochastic triad (EST) model 
will be proposed for comparison. 

While the EST model is not of transport type, it will provide comparison between 
two different classes of stochastic dynamical systems. The two classes treated here 
are (1) the helicity preserving stochastic triad (HST) (comprising both LU and 
SALT on the helical basis) and (2) the energy preserving stochastic triad (EST) 
of [17] projected onto the helical basis. The solution behaviour of the HST model 
will be compared to that for the EST model for several data assimilation objectives 
formulated on the helical triad modes. For classical deterministic models one obtains 
a system of ordinary differential equations. However, for stochastic dynamics a set 
of stochastic differential equations (SDEs) is obtained [8, 14, 25]. 

The goal of the data assimilation procedure in this context is twofold: firstly, 
it is used to calibrate the uncertainty of the model (the amplitude of the noise). 
Secondly, once the calibration is complete, the particle filtering methodology can 
be used to reduce the uncertainty. We want the distribution of the fluctuations to 
be properly approximated. In the absence of stochasticity, all particles would go 
in the same direction and the initial spread would rapidly disappear because of the 
hyperbolic character of the model. In the absence of a reasonable spread, the particle 
filter methodology will eventually collapse. For this reason, we need to introduce 
stochasticity into the system that correctly characterises the fluctuation dynamics. In 
particular, we want to find the type of noise amplitude and the stochastic parameters 
for which the distribution of the output samples is reasonably uniform. 


Structure of the Paper In Sect. 2 we introduce the triad models for incompressible 
flows modelled by the Euler equation in its deterministic and stochastic form. To 
this end, we introduce the stochastic parametrisation paradigms. Building upon 
these models for the 3D Euler equation, we then present reduced order triad models 
derived from the original equations. The derivation follows the classical approach 
for triad models from the literature, that has been successfully employed in the deter- 
ministic case. Our full derivations, complete also for the stochastically parametrised 
models, can be found in Appendix 2. The Data Assimilation experiments are carried 
out in Sect. 3. We first briefly explain the standard particle filter methodology, and 
then in Sect. 3.1 we present the findings of our numerical studies. In particular, 
Sect. 3.1 presents the results of the main numerical studies in this work. These are: 
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The model realisations of the stochastic models which we propose in this work 
are presented in Sect. 3.1.3 for different realisations of the noise. Further, we 
numerically confirm here the physical conservation properties of the models that 
motivated their theoretical design. 

The model statistics for a large number of stochastic realisations are presented 
in Sect. 3.1.4. Crucially, the evolution of the model statistics in time reveals 
differences between the two stochastic models on the level of the individual triad 
energies which go beyond the conservation properties. Moreover, we show here 
the (in-)stability of either model with respect to large noise parameters. 

In Sect. 3.1.5 we exhibit the results of the data assimilation performed using 
the transition kernels derived from the theoretical models developed in this 
paper. We show here that using the stochastic transition kernels associated with 
our proposed stochastic models improves the particle filtering procedure and 
produces efficient ensemble evolutions that are well-suited for data assimilation 
purposes. 


In Sect. 4 we describe our conclusions on this topic. We conclude the paper 


with a number of appendices: in Appendix 1 one can find a list of notations and 
standard identities, in Appendix 2 we present a detailed derivation of shell models 
(deterministic and stochastic), in Appendix 3 we introduce some supplementary 


numerics related to the noise amplitude calibration. 


Code Availability The code corresponding to the numerical experiments 
in this paper is archived in [22]. The GitHub repository is located at 


https://github.com/alobbe/stochastic-triads. 


2 Reduced Order Models for Incompressible Fluids 


2.1 Reduced Order Models for the 3D Euler Equation 


The 3D Euler equations model incompressible inviscid fluid dynamics. These 
equations may be written by using the Leray operator P to project onto the 


divergence-free part of its operand as 


Ov — 
ot — 


ôE êC 
=7(= x x) 


1 
with conserved Energy E(v) = Í zY - vd? x 
R3 


P(v x curly) 


1 
and conserved helicity C(v) = P] Í v - curl v d?x 
R3 


where 5/dv represents variational derivative with respect to the fluid velocity v. 


(1) 
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Following [9, 10] we use a Galerkin expansion in orthogonal vector modes that 
are eigenfunctions of the curl operator. Assume the fluid is contained in a periodic 
box D c R? of side length L > 0. Then the velocity v and vorticity o := V x v 
may be expanded in circularly polarised or helical modes h+ (k) exp; k- x), with the 
wave vectors k € K :— (2z:/L)Z?. The modes shall be orthogonal, i.e. 


[E hs, (p) expGp-x)-[h, (q) expGq-x)]* dx = Cópqó,.,,: C > const. (2) 


The complex vector amplitudes h+ (k) should satisfy k-h4 (k) = 0 and ik xh+ (k) = 
+|k|h+ (k). A convenient choice of basis for the h4 (k) is then given by 


ha(k):=vx«tiv, with «:=k/k, v:=kxT/|kxT|, T :— const, 
(3) 


for which |h.. (k)|? := h4 (k) - h4 (k)* = 2 and h4 (k) - hz (k)* = 0. 

At this point, one notices the key features of the helical modes h+ (k) exp(ik - x) 
which greatly simplifies analysis of modal expansions of the 3D Euler and related 
equations, such as 3D Navier-Stokes. Namely, the helical modes h+ (k) expGk - x), 
are divergence-free eigenfunctions of the curl operator. Specifically, 


V.h,ü)e/** = ik. h, (k)e!** = 0 (4) 
and 
V x h,()e/** = ik x hy (k)e'** = s|k[h, (kKe**. (5) 


See [9, 10, 28, 29] for more information about how this Galerkin decomposition into 
divergence-free eigenfunctions of the curl are used as a standard tool in analysis of 
3D solution behaviour of the deterministic Euler fluid equations and Navier-Stokes 
fluid equations. In particular, the helical mode expansions in Eqs. (6) comprise the 
source of the popular shell models as finite-dimensional expansions of turbulent 
fluid dynamics. Thus, this expansion provides a useful framework for studying low- 
dimension stochastic models of 3D Navier-Stokes turbulence. 

In terms of the basis of helical modes h+ (k) exp(ik- x), the divergence-free fluid 
velocity v(x, t) and vorticity w(x, t) are expressed in [28, 29] in terms of complex 
vector amplitudes u(k, t), w (k, t) € C3, respectively, 


v(x, 1) = Y up, )e'?* = 37 Y? as, (p. Dhs, (p)eP*, 


Pp p sp=3 


(6) 
ex, r) :2 3 wq, Det := 3 $ slal as (a, Dhs Me™. 
q 


q Sgt 
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Here, the choice 


u(k, t) :— a4 (k, thy (k) + a- (k, )h_(k) = Y 7 as (k, Dh; (k), (7) 


AY oa 


with a? (k) = a;(—k) [29], was made, so that (5) implies 


w (k, t) := IKI (a, (k, t)h4(k) — a_(k, t)h (k)) = LDD Sk s, (K, t)hs, (K). 


spear 
(8) 


The conservation laws for the Euler fluid kinetic energy and helicity—expressed as 
integrals over the spatially periodic box D—can be evaluated in Fourier space via 
Parseval’s theorem, as follows, 


1 1 " " 
an |v(x, DI dx 5 uk, t) - u*(k, n=) a ds, (k, t) a5 (k, t), 


k k == 


I v(x, t) - curlv(x, t) d?x = X uk, t) - w*(k, f) 
D k 


=J} ksz as, (k, tJa} (k, t) hs (k) - hš (k) 


k spo 


=>. 2- ksx ds, (k, t)až (k, t). 


k sp 


(9) 


Expanding the terms of the Euler equation in curl form (57), we obtain the Euler 
equations for the coefficients as, (k, t). For all k € K, sg € {+, —} we have 


1 
Bak )——: J> P jGplpl-sslaDa;, (p. Day, (q, Dh; (P)xhy, (q)-hy, (8). 
p+q+k=0 $p.5g 
(10) 


For the explicit derivation of Eq. (10) see section "Deterministic Euler" in 
Appendix 2. 

The elementary interactions in Fourier space take place between triads of wave 
vectors such that k + p+ q = 0, as exemplified in Eq. (10) above. There are two 
degrees of freedom per wave vector, (a+, a—), so eight different types of interaction 
are allowed according to the value of the triplet (sk, Sp, 5;) =  (El,-El, 1). 
Consider a fixed triple of wave vectors k, pq € K such that k + p +q = 
O and a fixed triple Sk, 55, s; € {+,—}. This gives rise to three coefficients 
ds, (k, t), as, (p, t), As, (q, f), which we compactly summarise into the complex 
vector 
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3 
a= (as, , asp» ds,) eC. 


The dynamics of a is determined by the three equations obtained from (10) 


da;, 

di = gGplpl — salalas,a5, + R, (11) 
das, 

di = 864ldl — sea; a5, + R, (12) 
das, Ox 

dr = 8 (sells — splpDas a5, + R, 13) 


We pick out a summand and cycle through k, p, q 


da 
dr = 86plpl — slaa; a5, . (14) 
das, 
dp 7 8G«lal — sk|kDas as, . (045) 
das, 
dt => = g(sy|k| mE sp|pDa;, 45, Sp? (16) 
with the constant complex scalar 
1 * * 
8 = —7h,, P x h; (q) - hy, (k). (17) 


The equations corresponding to the single triad interaction of type (sx, Sp, Sq) with 
k + p + q = 0 thus have the complex vector form also derived in [28], 


da 
dt 


= ga* x Da* = g(a x Da)*, (18) 


with the constant diagonal matrix 


D :— diag(silkl, sp |p|, sglal). (19) 


The form of the factor g defined in (17) above can be calculated from (3) to show that 
it depends on the shape and the orientation of the wave-vector triad, but not on its 
scale; since the real and imaginary parts of the complex helical vector amplitudes h;, 
are unit vectors. Moreover, Da can be seen to represent the (sk, Sp, Sq) components 
of the vorticity vector amplitude w through Eq. (8) above. Two conservation laws 
for real-valued triad energy and helicity follow immediately from Eq. (18), as 


d d 
—(a-a*)=0 and —(a-Da*)=0. (20) 
dt dt 
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The dynamical system in Eq. (18) is similar to rigid body dynamics, but replaced by 
complex angular momentum (a), complex angular velocity (Da) and real moment 
of inertia I = D~! relating the two complex quantities. ! 


2.2 Stochastic Parametrizations for the 3D Euler Equation 


In the following we introduce a reduced order model for the stochastic parametriza- 
tions introduced through the Stochastic Advection by Lie Transport paradigm as 
well as the Location Uncertainty paradigm. We explain below the rationale of these 
parametrizations: 


2.2.1 Modelling Under the Stochastic Advection by Lie Transport 
Principle 


The SALT equations were derived in [16] using a Stratonovich stochastic version 
of Hamilton's variational principle [18] in combination with Kraichnan's scalar 
turbulence model based on Stratonovich stochastic Lagrangian paths [20]. The 
application of Hamilton's principle with an imposed stochastic Lie transport 
constraint implied an Euler-Poincaré equation for the fluid motion [18]. The 3D 
SALT Euler equations for divergence-free fluid velocity v(x, t) are given by 


dv + (dx; - V)v + vj V dx! =-Vdp, 


with dx; = vdt + »3 E(x) o dW! en 


I 


As discussed in [16], this motion equation yields a Kelvin-Noether circulation 
theorem for the stochastic system 


ag vdx-- d Vdp-dx=0. (22) 
cix) c(X;) 


This stochastic Kelvin circulation theorem is has the same form as that for the 
deterministic system, except that each line element of the material loop in Kelvin's 
theorem follows the Stratonovich stochastic Lagrangian path, x;. 

The real vectors £; comprise the time-independent noise amplitudes which are 
to be determined from data assimilation. The W’ are independent (uni-dimensional) 
standard Brownian motions and o denotes stochastic integration in the Stratonovich 
sense. The curl form of the SALT Euler motion equation in (21) is obtained 


! Rigid body dynamics with complex angular momentum has also been discussed previously in 
[3]. 

? An exposition of Brownian motion, stochastic calculus and the Stratonovich integral is to be 
found, for example, in the monograph [19]. 
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from (56) and given by 
dv — dx; x curlv+ V(v- dx;) = —Vdp. (23) 


The motion equation (23) and its curl yielding the SALT vorticity equation implies 
a formula for the evolution of the helicity of the flow, A, defined as 


A := Í v - curlv d?x . (24) 
D 
Upon applying the divergence theorem, one finds 
dA = -f Tl - (v - curlv) dx; + curlv dp) ds. (25) 
aD 


Thus, in a periodic 3D domain, or in an infinite 3D domain with asymptotically 
vanishing boundary conditions, the SALT motion equation in (21) or (23) preserves 
the helicity, A, defined in (24). However, a glance at the SALT motion equation 
in (23) informs us that it will not preserve the kinetic energy, since even with the 
usual fluid boundary conditions divv = 0 implies 


5 divi? = fv dx, x curlvd?x z 0. (26) 


2.2. Modeling Under the Location Uncertainty Principle 


The Location Uncertainty principle consists in decomposing the flow trajectory 
x: Q x R* — Q over a bounded domain, à C IR? 


dx, = v (x, t) dt + o (xi, t) dW; (27) 


in terms of v (x;, f), a smooth-in-time component of the (Lagrangian) velocity 
and a noise o (x+, tf) dW;, which has here to be understood in the Itó sense and 
that accounts for the unresolved processes. The Wiener process, W, is a H- 
valued (cylindrical) Brownian motion, where H is the Hilbert space of square 
integrable functions. The noise is then properly defined as the application of an 
Hilbert-Schmidt symmetric integral kernel o; f (x) = f. S ò (x, y, t) f (y) dy to 
the H-valued cylindrical Wiener process W as 


(o ,dW;) (x) = [5 (x, y, t) dW* (y) dy, (28) 
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The role of the integrable kernel ð is to impose a spatial correlation on the small- 
scale component. It leads to the covariance tensor Q 


Qij (x, y.t,8) = E| (dW, (x))! (ordWs Q7 
=ô(t— sa f Oik (X, Z, t) Ox; (Z, Y, S) dz, 
S 


of the centered Gaussian process 0,dW; ~ N (0, Qdr). The diagonal components 
of the covariance tensor per unit of time, referred to as the variance tensor, a, 
is a positive definite matrix defined as a(x, t)ô(t — t/)dt = Q(x, x, t, t^), that 
quantifies the strength of the noise and has the dimension of a viscosity in m*s~!. 
The operator Q being compact auto-adjoint positive definite operator on H, it 
admits eigenfunctions £, (-, t) with (strictly) positive eigenvalues A; (t) satisfying 
X, cN Àn (t) < +00. As a consequence, the noise and the variance tensor a can be 
expressed through the spectral representation 


o dW, (x) = YO n O En (x. 1) df (29) 
neN 

a(x,t) — 9 dn (08, 0, 08; (x, 1). (30) 
neN 


The rate of change of a volume V, of the scalar q is given by the stochastic Reynolds 
transport theorem, introduced in [23] 


af q (x,t) dx = n [Dia +q4V. [v* dt + «dW, |} (x, t) dx, (31) 
Vi V 
with the transport operator 


1 
D;q = drq + [v* dt + o, aW,]- Vq — 2v - (aVq) dt. (32) 


In this formula, the first component of the right-hand side is the increment in time at 
a fixed location of the process q, that is diq = q (x;, t + dt) — q (xi, t), playing the 
role of a derivative in time for a non differentiable process. The effective velocity v* 
is defined as 


1 
Yav- V ata, (Vo), (33) 


where the velocity component v, = IV - a results from the noise inhomogeneities. 
For incompressible homogeneous noise as considered in this work v* — v. 
Besides, the diffusion term exactly balances the noise brought by the noise. With 
Stratonovich convention and a homogeneous noise the transport operator takes a 
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simplified form similar to the material derivative: 
Dig = dig + (vdt +0; o dW;) - Vq. (34) 


For a divergence free homogeneous noise, the Euler equation, in the LU framework, 
can then be defined as: 


div + (vdt +o, odW,): Vv = —Vdp,, V-v=0, (35) 


where dp; denotes the pressure composed of a finite variation term and a martingale 
pressure term. With the Leray projection, P, this pressure term can be removed and 
we obtain, the inertial form of the Euler equation: 


dv + IP((dx, . V)v) =0, V-v=0. (36) 


2.3 Triad Model Comparison 


The reduced order model for the full-scale 3D SALT Euler and 3D LU Euler for a 
single triad interaction equation is obtained by projecting the continuous stochastic 
Euler models onto the helical modes, in the same fashion as for the deterministic 
equation (18). Therefore, we introduce an additional Stratonovich stochastic term 
into the transport velocity in (7) as 


dx, (k, £) := (a+ (k, )hy + a—(k, t)h-) dr + Y ^ (P, (kyhy + »^ (k)h-) o dW), 
i 

(37) 
where the k-dependent complex vector b(k) :— (bs, bs p bs 22 ex represents the 
time-independent noise amplitude which is to be determined from data assimilation, 
similar to the continuous stochastic models (21). Enumerating the equation for a 
single triad then yields, after rearranging using exchange symmetry in (k, p, q), the 
matrix equation 


As, 0 —dqSsqds, PSpAs, * ds, dt + bs, o dW, i 
d ds, | =8 | 4544s, 0 —kskas, ds, dt + bs, o dW; i (38) 
ds, — pSpüs, Ks as, 0 ds, dt + bs, o dW; 


Upon applying the previous steps for the deterministic case to the stochastic velocity 
in (37), the single triad interaction dynamics for the SALT case would emerge as, 
cf. Eq. (18), 


da = e(atk, dt + b(k) o aw,) x Da*. (39) 
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The details of this computation can be found in section “LU Euler” in Appendix 2 
for the 3D LU Euler model and in section “SALT Euler” in Appendix 2 for the 3D 
SALT Euler model. 

Remarkably, the HST equation for triad interaction (39) still preserves the triad 
helicity a - Da*. Hence we name this model the helicity preserving stochastic triad 
(HST) model. Note that in both equations we use as single source of noise (One 
Brownian motion drives the entire system). 

It is readily checked that the HST triad evolution (39) preserves the helicity. Let’s 
have a look at the diffusion coefficients. 


a5sgqbq — ajsppbp bpsqqa; — basppar, 
a“ x Db = a; Sk Dx —arsqqbg |, bx Dař = | bąsęķkaž — besq qq 
ai Sppby — ay skkbx DkSp Pa, — bpskkağ 
(40) 
Taking the difference 


a5 (sqq + Spp)bq — app + sqq)bp 
a* x Db — b x Da* = ag (skk + sqq)by — ag (sqq + skk)bą |- (41) 
ar (Spp + sk) bp — a; (skk + Spp)bk 


Writing p := Tr D we get 


a5 (p skk)bq az (p skk)bp 
a* x Db — b x Da* = az (o — spp)bx — af (p — sp p)bo ] (42) 
aj (p — sqq)bp — a5 (P — sqq)bx 


So that the difference term becomes 


a* x Db — b x Da* = (p Id —D)(a x b)*. (43) 


Since the projections of the LU and SALT models onto a single triad are 
indistinguishable, we introduce a different model that conserves energy on a single 
triad to enable a comparison between energy conserving and helicity conserving 
models. 


Energy-Preserving Stochastic Triad (EST) Model We introduce below a mod- 
ified version of the HST triad equation (39) that introduces stochasticity into the 
vorticity instead of into the transport velocity and thereby conserves the energy. 
This is inspired by the full-scale model introduced in [17]. The reduced model is as 
follows 


da = — ga* x D(atk, t) dt + b(k) o aw.) ; (44) 
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We call this model the energy preserving stochastic triad (EST). Written in matrix 
form Eq. (44) becomes 


* 


ds, 0 —as as, | [kax(as, dt + bs (K) o 4 W;) 
d|a,|—8| a, 0 ds, psp(as, dt + bs,(p)odW,)| . | (45 
ds, —as, as, 0 qsq (as, dt + bs, (q) o dW,) 


The exchange symmetry between the two models HST and EST in the placement 
of the noise in Eqs. (39) and (44) is apparent already in the exchange symmetry 
between velocity and vorticity in Euler’s fluid equations (1). 


Deviation from the Conservation Laws We can write the equations for the 
deviation from the conservation laws, which is present in both models. The SALT 
model deviates from the energy conservation by 


d; Eyst = gb: (Da* x a*) o dW, (46) 


whereas the LU model deviates from the helicity conservation by 


d; Hest = gDb- ( a* x a*) o dW;. (47) 


This is seen, since, to get the energy we dot the HST equation with a* and to get 
helicity we dot the EST equation with Da* and use the standard identities 


a* . (b x Da*) = b. (Da* x a*) (48) 
Ja* - (a* x Db) = Db. (Da* x a*). (49) 


Therefore, b respects the right scaling and no further scale adjustments between the 
SALT and LU noise scaling need to be performed in order to compare the models. 


3 Data Assimilation Comparison 


In this section, we perform a comparative study of the two reduced order models 
(HST and EST) introduced above by using data assimilation tools. The particular 
methodology that we make use of is that of particle filters. We will first briefly 
explain the particle filtering methodology in a generic framework: 

Let X and Z be two processes defined on a given probability space (Q, F, P). 
The process X is usually called the signal process or the truth and Z is the 
observation process. In this paper, X is the pathwise solution of a (deterministic) 
shell model. The pair of processes (X, Z) forms the basis of the nonlinear 
filtering problem which consists in finding the best approximation of the posterior 
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distribution of the signal X, given the observations Z1, Z2,..., Zi, 3 The posterior 
distribution of the signal at time t is denoted by z;. We let dy be the dimension 
of the state space and dz be the dimension of the observation space. This 
mixed continuous-discrete time framework can be embedded into a fully discrete 
framework, whereby one is interested in computing the conditional probability law 
of the signal at the time corresponding to the observation time. in other words 
one wants to compute the conditional distribution 7, of X(t,) given the data 
Z(t1), Z(t2),..., Z(t,). The process X is assumed to be a Markov process, and 
we will denote by A, its transition kernel, that is 


Kn : R& x B(R®) > [0, 1], Kn(x, B) = P(X, € B|X,, , =x) (50) 


for any Borel measurable set B € B(R^X) and x € R*. The process Z models 
noisy measurements of the truth, using the so-called observation operator # : 
RX — RZ: 


Zn = H (Xn) + Vn (51) 


where {V,}n>0 are independent identically distributed random variables that rep- 
resent the measurement noise and # is a Borel-measurable function. In this 
paper we will assume that {V„}n>0 have standard normal distributions, but the 
same methodology can be applied to more general distributions. Observations are 
incorporated into the system at assimilation times. The following recursion formula 
holds (see [2]) 


Tn = gn * Tn—1Ky (52) 


where by ‘x’ we denoted the projective product (see e.g. Definition 10.4 in [2]). 

In the following, we compare approximations of the posterior distribution of 
the signal using particle filters. These are sequential Monte Carlo methods which 
generate approximations of the posterior distribution z; using sets of particles. That 
is, they generate approximations that are (random) measures of the form 


"T £ £ 
Tn ~ A Wy Ó(x,), 
£ 


where 6 is the Dirac delta distribution, wl, w2, ... are the weights of the particles 
and ah 5 ... are their corresponding positions. Particle filters are used to make 


inferences about the signal process by using Bayes' theorem, the time-evolution 
induced by the signal X, and the observation process Z. 


3 For a mathematical introduction on the subject, see e.g. [2]. 
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In a standard particle filter, the particles evolve between assimilation times 
according to the law of the signal. As we explain below, at each assimilation time 
the observation is incorporated into the system through the likelihood function: 


g: RX >R4, gE) = gizeH (x)) such that IP(Z,edz;|X; — x) = g7' (x)dz; 
(53) 


and all particles are weighted depending on the likelihood of their corresponding 
position, given the observation. More precisely, the particle £ is given the weight 
w! = ge ” (xe). Heuristically, the particle weight measures how close the particle 
trajectory is to the signal trajectory. A selection procedure is then applied to the set 
of weighted particles. Particles with higher conditional likelihood (guided by the 
observation) have higher weights and will be multiplied, while those which have 
small likelihoods will be eliminated. For the basic particle filter, this is done by 
sampling with replacement from the population of particles, with corresponding 
probabilities proportional to their weights. 

A Monte Carlo implementation of the transition kernel of the signal may not 
always yield good approximations. In many situations one replaces the original 
transition kernel with likelihood informed importance proposals, leading to much 
better approximations. One situation when this is necessary is when the original 
process is actually deterministic (aside for the initial position which is assumed to 
be random). This is the case in our paper. 

To overcome the collapse of the particle filter when using deterministic transition 
kernels, one can use a Markov Chain Monte Carlo procedure that leaves the 
deterministic dynamics invariant. This procedure can be costly and might not always 
introduce enough spread into the sample. In this paper, we propose a different 
approach, which we illustrate numerically in Sect. 3.1.2 below. In particular, 
we propose two different transition kernels based on the physical conservation 
properties: 


* The transition kernel associated with the HST model equation (38). We will 
denote this transition kernel by HST. As we have explained above, this transition 
kernel preserves the helicity of the system. 

* The transition kernel associated with the triad model equation (45) we will denote 
this transition kernel by KEST., As we have explained above, this transition kernel 
preserves the energy of the system. 


3.1 Numerical Studies 
3.1.1 Numerical Implementation 
The models are discretised using the stochastic SSPRK3 scheme which is docu- 


mented, for example, in [11]. In our specific case, for instance, the HST model is 
discretised as 
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qi = an + g(a, x Da?) Ar + g(b x Da) AW 
g = (3/4)an + (1/4)(Qi + g((qi)* x DD At + gb x D(q1)) AW) 
Ant = (1/3)an + (2/3)(q5 + g((q5)* x D(q5)*)At + g(b x D(q5)) AW) 


where At denotes the timestep and AW the increment of the driving Brownian 
motion. Further, a, is the approximate complex vector amplitude at time tf = 
nAt. The EST model is discretised completely analogously. For the numerical 
simulations we chose the following triad throughout. We set 


k = [1,0,0], p= [0,—1,1], q—[-1.1, —1], (54) 


with parities sx = 1, sp = —1, s; = —1 and the initial value aj = Lj, 1, 1]. We 
set the parameter F = [1, 1, 1] and used a time stepsize of At = 0.0005. 


3.1.2 Data Assimilation for the Deterministic Model 


We illustrate the failure of the particle filter with deterministic transition kernel 
in Fig. 1. In this case, the particle filtering is performed for an ensemble of 
n — 25 particles evolving according to the deterministic triad dynamics. The initial 
ensemble is spread around the initial value ag of the signal according to a Gaussian 
distribution with standard deviation 1 /4/600 and, in particular, does not contain 
the true initial point. Data assimilation is performed every 10 time units and the 
Observations are taken from the modal energies of the truth with an observation 
error n distributed as n ~ V (0, C) with covariance C = diag(0.0057, 0.05?, 0.057). 
We observe that both the bias and the RMSE keep increasing with time to values 
much larger than the observation error. Moreover, the number of distinct particles 
decreases rapidly: after 30 steps, a single particle remains that is not the true particle 
since the true particle was not part of the initial cloud. The particle filter does not 
work. 


3.1.3 Reduced Order Model Realisations 


The deterministic model in Fig. 1a exhibits continually oscillating triad amplitudes. 
Plotted are the modal energies. Writing a = [ax, ap, aq] we call the real value a, ar 
the energy of mode k. Similarly for the two other modes. 

We simulate the model realisations for different noise scenarios. We simulate 
the effect of noise in each single mode. Let the noise amplitude vector be b = 
[bk, bp, bq]. Then we simulate the two models for b = [by = 0.1, bp = 0, b, = 
0], b = [by = 0, by = 0.1, b; = 0], and b = [by = 0, by = 0, b; = 0.1]. 
The trajectories of the modal energies for n — 20 realisations of the driving noise 
for each scenario are shown in Fig. 2. We also simulate the case of full noise for 
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(a) Deterministic Model (b) Ensemble for Mode p 


Was SA Fiter Determinatie 25 Particles Mode p ROSE Finer Owterrenstic 25 Partides Mode p 


Ml 


(c) Unique Particles (d) Bias (e) RMSE 


a i NUT 


Fig. 1 Deterministic Triad Model. The horizontal axis shows time. (a) Evolution of the modal 
energies (colored lines) as well as the total energy (dashed line) and helicity (dash-dotted line). The 
deterministic model exhibits continually oscillating modal energies. The simulation also confirms 
the conservation of energy and helicity. (b-e) Data assimilation for the deterministic model using 
particle filter. (b) Evolution of the energy of mode p of the signal (grey line) and evolution of the 
energy of mode p for the particle ensemble (blue lines). Noisy observations (black stars) are made 
and assimilated every 10 time units. (c) The number of unique particle positions in the filtering 
ensemble. (d) The bias of the particle ensemble wrt. the observations. (e) The RMSE of the particle 
ensemble wrt. the observations 
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Fig. 2 Model realisations for both stochastic triad models. Plotted are the modal energies (colored 
lines), total energy (black line) and helicity (grey line). The respective thick lines are the ensemble 
means, and the thin lines represent the different stochastic realisations. (a+ e) The noise coefficient 
b = [0.1, 0, 0]. (b - f) The noise coefficient b = [0,0.1, 0]. (c+g) The noise coefficient b = 
[0, 0, 0.1]. (d+h) The noise coefficient b = [0.1, 0.05, 0.01] 
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(a) EST mean (b) EST std. dev. (c) EST skew (d) EST kurtosis 
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Fig. 3 Evolution of statistical moments of the modal energies for both stochastic models in the full 
noise case. The statistics are computed pointwise in time from an ensemble of 1000 realisations up 
to a final time of 150. (a + e) Ensemble mean. (b + f) Ensemble standard deviation. (c + g) Ensemble 
skew. (d +h) Ensemble kurtosis 


the noise amplitude vector b = [by = 0.1, bp = 0.05, by = 0.01] which was 
calibrated to the data assimilation objective using the procedure explained in section 
“Calibration of the Noise Amplitude” in Appendix 3. The ensemble of n = 20 
realisations of the driving noise in the full noise case is depicted in Fig. 2d and h. 
In all cases, it can be observed that the mean energy amplitudes of the modes are 
dampened in both stochastic models. Furthermore, we can experimentally verify the 
conservation of triad energy for the EST model and the conservation of triad helicity 
for the HST model. 


3.1.4 Model Statistics 


Figure 3 shows various statistics of the generated ensembles of n = 1000 particles 
for the HST and EST triad models in the full noise case introduced above. We plot 
the ensemble mean, standard deviation, skew, and kurtosis. 

The effect of large noise coefficients is exemplified in Fig. 4. We observe that 
the HST model explodes whereas the EST model is more tolerant to large noise 
coefficients, and even in the extreme case, does not become unstable in the mean. 
The ensemble means are computed from n = 500 realisations, using the noise 
coefficient b = [0.0, 1.0, 0.0]. Moreover, to stress the EST model, we also ran the 
same experiment with a noise coefficient of b = [0.0, 10.0, 0.0] for the EST model 
alone. 

The mean ensemble for a large number of particles, n = 20,000, is shown in 
Fig. 5. We can observe that, compared to Fig. 3a and e the oscillations after time 40 
are reduced for a very large number of particles. Hence, we believe that the system 
stabilizes in the mean to stationary modal energies as the limiting effect of the noise. 
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Fig. 4 The effect of large noise coefficients on the mean. Evolution of the mean modal energies 
(colored lines), mean total energy (black), and mean helicity (grey) for the HST (a) and EST (b) 
model with noise coefficient b — [0, 1, 0]. (c) Evolution of the mean modal energies, mean total 
energy, and mean helicity for the EST model with the strong noise coefficient b = [0, 10, 0] 
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Fig. 5 Evolution of mean modal energies for a very large number of realisations for the EST (a) 
and HST (b) models. The mean is computed from 20,000 particles in the full noise case 


3.1.5 Data Assimilation 


Using the two stochastic models in the full noise case described above, we perform 
the data assimilation tests using the following framework: 

The signal process (the truth) is given by the deterministic triad model. The 
observations are the modal energies of the deterministic model, observed every 10 
time units, and perturbed by noise of the form 


n ~ N(0, C), (55) 


where the covariance matrix C € R? is chosen to be the diagonal matrix C = 
diag(0.005?, 0.05?, 0.05?). 

We use the sequential importance resampling (SIR) particle filter to assimilate 
the periodically observed signal process under the influence of observation noise. 
The particles evolve according to the stochastic triad models. Figures 6 and 7 show 
the results of filtering the ensemble of n — 100 particles of the EST and HST triad 
models, respectively. The ensembles are assessed in terms of the bias and RMSE 
statistics. We analyse the comparison details below: 
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Fig. 6 Filtering experiment for EST model using SIR particle filter. (a-c) Ensemble evolution for 
100 particles in mode k (a, red), p (b, blue), and q (c, green). The signal (grey) is the deterministic 
model and the observations (black stars) are noisy and taken and assimilated every 10 time units. 
(d-f) Bias of the filtering ensemble. (g-h) RMSE of the filtering ensemble 


Mode k 


This is the least energetic of all the modes (hence the reason why we observe it with 
the least amount of measurement noise). The cloud of particles is well placed around 
the truth even with the small sample. The bias remains small for both the HST 
and the EST versions and it reduces significantly when observations are assimilated 
more frequently (see Fig. 9 in Appendix 3) as well as when we use a large number 
(500) of particles (see Fig. 10 in Appendix 3). The RMSE remains small in all cases 
and decreases (though not substantially) when the DA step is small. 


Modes p and q 


These are the two energetic modes of the system. We used here a measurement 
noise that is one order of magnitude larger. Despite this, the results remain equally 
good. The cloud of particles provide a good envelope for the truth at all times. 
This validates the choice of the stochasticity: the uncertainty is properly modelled. 
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Fig.7 Filtering experiment for HST model using SIR particle filter. (a-c) Ensemble evolution for 
100 particles in mode k (a, red), p (b, blue), and q (c, green). The signal (grey) is the deterministic 
model and the observations (black stars) are noisy and taken and assimilated every 10 time units. 
(d-f) Bias of the filtering ensemble. (g-h) RMSE of the filtering ensemble 


For both models the bias can become very large, reaching 30% of the size of the 
oscillations for the HST model and 25% of the size of the oscillations for the EST 
model. As expected, it is drastically reduced when observations are assimilated more 
frequently. The RMSE for mode p is also large but substantially smaller for mode q. 
The addition of more intermediate DA steps or more particles has a less pronounced 
effect for the q mode. 


Remark 2 We record the Effective Sample Size (ESS) for a typical run (for both 
EST and HST) in Fig. 11. As usual, the ESS is computed just before the application 
of the resampling procedure. The ESS is seen to decay dramatically from 100 down 
to single digits numbers in most instances in time. 


Remark 3 We record the results over 10 independent runs of the filtering experi- 
ment for the EST model with 500 ensemble members in Fig. 12. More precisely, in 
graphs 12a, b and c each, we plot the mean across the 10 independent filtering runs 
together with the evolution of the signal and the individual ensemble means for each 
mode. The mean bias as well as the envelope obtained from the independent runs 
are shown in graphs 12d, e and f. The same is shown for the RMSE in graphs 12g, 
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h and i. Compared with a single run of the same experiment reported in Fig. 10 
we observe the approximations are now near perfect (the statistical error has been 
drastically reduced). 


4 Conclusions 


The introduction of stochasticity into the deterministic triad models leads to two new 
stochastic models. Stochasticity is introduced in a principled way (rather than ad- 
hoc). It starts with a full scale fluid dynamic model which is randomly perturbed. At 
the full scale, the stochastic parametrisation models the small-scale effects in fluid 
dynamics modelling. In particular, it efficiently captures the high-frequency small- 
scale dynamics and correctly correlates it with the slow, large-scale fluid motion. 
In addition, it is constrained to conserve either the helicity or the kinetic energy 
of the system. This inspires two different stochastic triad models of Euler type 
which we compare using data assimilation procedures based on particle filtering. 
The methodology we employ can be used as a benchmark when analysing new types 
of stochastic parametrisations: ours is the first study that assesses the efficiency of 
stochastic parametrisations from a data assimilation perspective. 

The introduction of stochasticity ensures that the correct spread (one that 
preserves the physical properties of the system) is introduced in the ensemble of 
particles. In its absence, the particle filter degenerates quite rapidly: after a few 
DA steps, a single particle survives the culling procedure which does offer a good 
approximation of the truth. A purely deterministic transition kernel does not work, 
generating a rapid degeneracy of the particle filter. 

The two stochastic systems (one which preserves helicity and the other one 
which preserves energy) are analysed using a standard particle filter. There is no 
need for additional procedures (such as tempering, nudging, or jittering). They 
perform equally well from the viewpoint of DA: both the RMSE and the bias are 
drastically reduced and stabilised when the noise is carefully calibrated. The two 
different stochastic kernels require different noise calibrations in order to perform 
well in similar data assimilation scenarios. This is somehow expected, given that the 
underlying stochastic parametrisations preserve different physical quantities. 


Appendix 1: Notation and Basic Identities 


Notation 


In this work we use the following notation. We write Z, R and C for the sets 
of integers, real numbers and complex numbers, respectively. Boldface letters 
denote three-dimensional complex vectors. For two complex vectors a and b with 


Comparison of Stochastic Parametrization Schemes Using Data Assimilation. . . 181 


components a; and bj, their dot product is denoted by 
3 
a-b= ai — ajbj e C. 


j=l 


This paper uses the Einstein convention of summing over repeated indices. The 
norm of the complex vector a is defined as 


la| = Va-a* = jajaj > 0, 


with the superscript symbol * denoting complex conjugation. Further, the cross 
product of two vectors a and b is given by 


a x b = (mb; — a3b2, azb; — ay b3, ayb2 — abi) € CÌ. 
The gradient of a scalar field $ : D C IR? — C ata point x € D is denoted by 
V(X) = (019). 9609, 056 (0) € C^. 


The divergence of a vector field y : D C R? — C? with components V; at a point 
X € D is defined as 


V + W(x) = day X) + 02V (x) + 03s (x) = Oj vj(x) e C 
and the curl of v at x is given by 


V x y (x) = (823 (X) — 832 (X), BBP) — 3V3), 912 (X) — 95 G9) € C. 


Vector Identities 


For three vectors a, b and c, we have the following algebraic vector identities 


a-(bxc)-—b.(ecxa)—c-(ax b), 

a x (b x ¢) = (a-c)b- (a- b)c, 

(a x b)- (ex d) = (a: c) (b - d) — (b - ¢)(a - d), 
axa=0. 


Moreover, we have the vector calculus identity 


(a: V)b + bjVaj = —a x (V x b) + V(a - b). (56) 
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Appendix 2: Derivation of Triad Models 
Deterministic Euler 


We compute the projection of the terms corresponding to the deterministic Euler 
vorticity equation in curl form: 


av vx (Vx v) + jV|v? = —Vp. (57) 


onto the helical basis. For the time-derivative we get 


| v(x, 1) “hy, e T dx = 5 ^5 dras, (p. Dh, (p) - hr, do f dil 
D D 


P Sp 
= L? X das, (k, t)hs, (k) - hž (k) 
Sp 


LI Lads, (k, t)hs, (k) ` hy, (k) 


= 2134s, (k, t). 


The vorticity term gives 
1 (v(x, t) x ex, t)) - hz (k)e™™* dx 
D 


=) M as (p. Dsqlqlas, (q, ths, (p) x hy, (q) : h7, (k) ji gra BE 
PQ 5p.5g D 


=L J, .a«.Dsjlala; (q, Oh, (p) x h; (q) : hý (k). 
p+qt+k=0 5p.5q 
(58) 


Note that we can write (58) in a form which is symmetric in p and q since, renaming 
the indices, 


$5 M. 0s,lala; (q, Dh% (p) x hi (q) - by, (k) 
p+qtk=0 55.5; 


= » Mak a, Dsplplaz, (p. hz (a) x hz, (p) - hf, 6) 
p+qt+k=0 $p.5g 


=- YO M à% (a DÐsplplaž, (p, Dh; (p) x h7 (q) - hy, (k). 
p+q+k=0 $p.5g 
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Therefore, 
ri (v(x, t) x w(x, t)) - hz (k)e "F* dx (59) 


L? 
=F 2 2.64 -splpia;, (p. ta;, (q, Dh; (p) x hy, (q) + hy, Œ). 
pt+qtk=0 5p.5q 
(60) 


Moreover, the gradient terms in (57) vanish upon expansion into helical modes. 
Thus, the Euler equation (57) in helical basis becomes 


1 


day k, =F J> 2 Gplpl—sqlalas, (p. Da; (q, Dh; (p)xby, (q)-hy, (k). 
pt+q+k=0 Sp .5q 


SALT Euler 


We expand the 3D SALT Navier-Stokes equation (23) using (6) and (37). Assume 
bs(—p) = b (p). 


1 dv(x, t) ; hy, (je dx = > 2 das, (p, t)hs, (p) : hy. (k) | e P-E) x dx 
D D 


P Sp 


= 2L3da,, (k, t) 


And 
| dx, (x, t) x ex, t) - hz (k)e "** dx 
D 


—- Y [as, (p. dt + ds, (p) o dW;]s;lqlas, (q, ths, (p) 


P.d Sp.Sq 
x hs, (q) " hy, (k) i ei (Ptq-k)-x dx 
D 


-D 5 Y [5 p. Ddi b; (p) o dWiJsqlala (q, Dh% (p) 
p+q+k=0 $p.5q 


x hy, (q) - hy, (K). 


Renaming the indices, we can write 
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DL Y) DY [ak p Ddr + bz, (p) o dWi]sqlala® (q, hf, (p) x hr, 
p+q+k=0 $p.5q 


=P $ Y [ak a. Dárzb; (Q) o aW; ]splpla;, (p. Dh% (q) x hy, (p) - hf, (k) 
p+qt+k=0 $p.$q 


--D DY [ak (a Ddo% (Q) o dW; ]splpla?, (p. Oh, (p) x h% (q) “he (k). 
pt+q+k=0 5p.5q 


(q) - hy, (k) 


Thus, we arrive at 


Í dx, (x, t) x @(x, t) - hz (k)e !** dx 
D 
L? 
: 203 [Gui (p) o dWras (q, t) — sp|plbs, (q) o dW;a;. (p. ©) 
p+q+k=0 $p.5q 
+ (as, (p. Ddts;|qlas (q, t) — as, (q, 0 dtsplpla; (p. ny i, (p) x hy (Q) - hy, (k). 


Therefore, 


1 
dyk D=; > X ala, (p) o dWraž (q. 1) — spIpIbŽ, (q) o dWraž, (p. 1)) 
p+q+k=0 5p.5q 


+ (as, (p. )dtsq|qlas, (q, t) — a5, (q, Ndtsp|plas, (p. ou, (p) x hy, (q) - hy, (K). 


LU Euler 


Written in terms of the SALT model, the LU model is 
dv + dx, - Vv + v/ V dx! — v/V(£ o dW,)/ = —V dp. 
Expanding the additional term gives 


VE o dW)/ = V Y baha (qet o qW;)! = Sighs (qh. (ge/t* o dW,. 
q q 
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Thus we get 


v/V(E o dW)! = 37 Y a, (p. hy, (piqb;, (hi, (qe ^9 * o aw, 
p.d Sp Sq 


Now projecting and renaming, we have 
f VİVE o dWj) - aX (k, hz (9e! F* dx = 
D 


=X X a% as, (p. D[bs, (q) o dW;]Gq - hz ()) (hs, (P) - hs, (q) |. eP t979 ax 
D 


p.q Sp.8q 


=L YY ah. Da; (p. DID; (a) o dW;]C-iq - hy, (K) (hz, (p) - hz, (q)) 


Sp 


p+qt+k=0 5p.5q 
= M VP as (s 05, (p. DIDS (@) o dWi] + a5, (a, LDS, (p) o dW; 
p+qt+k=0 5p.5q 


fet = (i (p + q) -h (k)) (hy, (p) - hs, (q). 


Note that, due to the triad condition p+ q+ k= 0, 


k k 
dj a9, Si es 


so that the difference term between SALT and LU vanishes in the helical projection 
and the two projected models coincide. 


Appendix 3: Supplementary Numerics 


Calibration of the Noise Amplitude 


To calibrate the noise amplitude for the data assimilation experiments, we rely 
on two forecast verification metrics. The rank histogram (or Talagrand histogram) 
and the continuous ranked probability score (CRPS). We evaluated these for both 
models on 64 different noise amplitude vectors. The metrics were recorded by 
running the data assimilation/particle filtering experiment described in the main 
text for the noise amplitude vectors b = [bx, bp, bq] resulting from all possible 
combinations of by € (0.05, 0.1, 0.2, 0.5}, bp € (0.025, 0.05, 0.1, 0.2} and by € 
(0.01, 0.02, 0.04, 0.1}. We present the mean CRPS scores for the top 5 tested noise 
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Table 1 CRPS scores (lower is better) for the five best—in terms of overall mean CRPS score— 
tested noise amplitude vectors b (first column). The mean CRPS score over the three modal 
energies is shown for the EST (second column) and HST (third column) models. The overall mean 
CRPS score for the respective noise amplitude is shown in the Mean column (fourth column). 
Finally, we provide references to the figures showing the associated rank histograms (last column) 


b EST HST Mean Histograms 
[0.05, 0.025, 0.01] 0.0282 0.0394 0.0338 Figure 8a + f 
[0.05, 0.025, 0.02] 0.0338 0.0392 0.0365 Figure 8b + g 
[0.10, 0.025, 0.02] 0.0352 0.0385 0.0368 Figure 8c +h 
[0.10, 0.05, 0.02] 0.0393 0.0356 0.0375 Figure 8d + i 
[0.10, 0.05, 0.01] 0.0363 0.0409 0.0386 Figure 8e +j 
stool felt ndm -| L anilum... alllin, j ahh... 
Basn- -snn anak. Esna as--Ennannl JL Ioun- notata I p EE 
L.. | | rum | : enm | ee | EEEE | 
(a) EST (b) EST (c) EST (d) EST (e) EST 
T T TT TT ancttllffnes athl. 
Ianaualeue- .l-- Sí cse. 4E.-s-2s-reaadail ALLA bsaotnaannn HE | 
re =! e | ; mE | š hib anl. : TIN | 
(f) HST (g) HST (h) HST (i) HST (j) HST 


Fig. 8 Rank histograms for the 5 best noise amplitude vectors in terms of CRPS score (see Table 1) 
for the EST and HST models. Each individual graph shows the rank histogram of an ensemble of 
15 particles run up to a final time of 1400, with data assimilation performed every 10 time units. 
The top histogram in each subfigure represents the ensemble for mode k, the middle histogram 
represents the ensemble for mode p and the bottom one for mode q 


amplitude vectors in Table 1. The mean is taken across both models. Based on 
this calibration, we chose the case b = [0.10, 0.05, 0.01] for the data assimilation 
experiment, as the other ones have inferior rank histograms, so we achieve a good 
balance between the visual judgment of rank histograms and the CRPS score 
(Fig. 8). 


Data Assimilation Verification 


See Figs. 9, 10, 11, and 12. 
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Fig. 9 Filtering experiment for EST model using SIR particle filter with a small data assimilation 
interval of 5 time units. (a-c) Ensemble evolution for 100 particles in mode k (a, red), p (b, blue), 
and q (c, green). The signal (grey) is the deterministic model and the observations (black stars) 
are noisy and taken and assimilated every 5 time units. (d-f) Bias of the filtering ensemble. (g-h) 
RMSE of the filtering ensemble 
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Fig. 10 Filtering experiment for EST model using SIR particle filter with a large particle ensemble 
of 500 members. (a-c) Ensemble evolution for 500 particles in mode k (a, red), p (b, blue), and q 
(c, green). The signal (grey) is the deterministic model and the observations (black stars) are noisy 
and taken and assimilated every 5 time units. (d—f) Bias of the filtering ensemble. (g-h) RMSE of 
the filtering ensemble 
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Fig. 11 Typical ESS for the filtering experiments. See Remark 2 in the main text. (a) ESS for EST 
experiment. (b) ESS for HST experiment 
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Fig. 12 Statistics of the mean ensemble over 10 independent runs of the particle filter with 500 
ensemble members for the EST model. See Remark 3 in the main text 
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Abstract Conserved quantities in geophysical flows play an important role in the 
characterisation of geophysical dynamics and aid the development of structure- 
preserving numerical methods. A significant family of conserved quantities is 
formed by the Casimirs i.e., integral conservation laws that are in the kernel of the 
underlying Poisson bracket. The Casimirs hence determine the geometric structure 
of the geophysical fluid equations among which the enstrophy is well known. Often 
Casimirs are proposed on heuristic grounds and later verified to be part of the 
kernel of the Poisson bracket. In this work, we will explicitly construct Casimirs by 
rewriting the Poisson bracket in vorticity-divergence coordinates thereby providing 
explicit construction of Casimirs for 2D geophysical flow dynamics. 


1 Introduction 


Models of geophysical flows involve a fluid with a free interface under the influence 
of gravity on a rotating domain, often forced by variations in temperature, salinity 
and density. In this paper, we will consider the thermal rotating shallow water 
equations, which is a two dimensional model that includes all of the features above. 
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The variations of temperature and salinity are collected in a single scalar field that 
we call the buoyancy. The thermal rotating shallow water equations can be derived in 
several different ways from the rotating Euler equations with stratification. The best 
known method is to assume that the domain is shallow, allowing the replacement of 
the prognostic equation for the vertical velocity by a hydrostatic pressure relation 
from which the vertical velocity can be inferred. The result is the system of inviscid 
primitive equations. Upon assuming columnar motion, e.g., motivated by the Taylor- 
Proudman theorem, meaning that the horizontal velocity field does not depend on 
the vertical coordinate, one can vertically average the primitive equations to obtain 
the thermal rotating shallow water equations. 

The focus in this contribution is with the preservation of key structures that 
characterise the governing equations and their dynamics. In geophysical fluid 
dynamics (GFD) models in 2D so-called conservation of Casimirs forms a major 
framework for the modelling and construction of methods for their numerical 
treatment. We present an explicit constructive method with which the Casimirs 
of a general class of GFD models can be explicitly computed. This provides an 
alternative to earlier work by Cotter and Holm [2013] that yields explicit Casimirs 
for 2D systems. 

This entire sequence of assumptions and approximations can be performed at 
the level of the Lagrangian formulation of the models, as shown in Holm and 
Luesink [2021]. Doing so is helpful because the geometric structure associated with 
such a variational formulation implies various important dynamical quantities and 
conservation laws. For two-dimensional geophysical fluids, we find that the key 
dynamical quantities are the potential vorticity and the potential buoyancy, where 
the latter is the ratio of the buoyancy to the depth. Both quantities play a key role in 
cyclogenesis, as shown in Holm et al. [2021]. In the present work, we will show that 
the potential vorticity and potential buoyancy can also be used to give an alternative 
formulation of the fluid equations. This formulation is particularly useful for the 
identification of Casimirs, which are conserved integral quantities. The Casimirs 
in turn help interpret mechanisms in geophysical turbulence. For instance, in the 
incompressible setting without buoyancy effects in two dimensions, the enstrophy 
is one of the Casimirs and plays a central role in the double cascade predicted by 
Kraichnan [1967]. From a geometric point of view, the Casimirs are the functionals 
that form the kernel of the Poisson bracket. Usually, the form of the Casimirs is 
assumed or guessed and then verified by checking whether the Poisson bracket 
indeed vanishes. We will use two subsequent changes of variables to provide a 
constructive derivation for the form of the Casimirs. 

The organization of this paper is as follows. In Sect. 2 we present the main 
equations relevant to 2D geophysical flow and sketch the central importance of 
the retaining symmetries and conserved properties of the models. Among these 
properties, Casimirs are a major structure for the GFD models in 2D. Explicit 
methods to determine these Casimirs will be sketched in Sect. 3. Concluding 
remarks are collected in Sect. 4. 
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In Holm and Luesink [2021] it is shown that starting from the Lagrangian for 
the rotating, stratified Euler equations in three dimensions, one can derive the 
Lagrangian for the thermal rotating shallow water equations by assuming hydro- 
static pressure and applying vertical averaging. The benefit of performing this 
sequence of approximations at the level of the Lagrangian is that the geometric 
structure is not affected. This means that the thermal rotating shallow water 
equations can be formulated using a Lie-Poisson bracket. As was shown in Marsden 
and Weinstein [1983], there exist multiple equivalent formulations in terms of Lie- 
Poisson brackets of the two-dimensional Euler equations. Similar formulations can 
be derived for the compressible two-dimensional fluid models. This was first done 
in Holm and Long [1989]. The retained Lagrangian structure in 2D is basic to a 
possible systematic approach to determining the conserved quantities associated 
with the thermal rotating shallow water equations. This systematic approach will 
be elaborated on in this paper. 

It is possible to derive the equations for geophysical fluid dynamics on arbitrary 
smooth manifolds starting from the Euler equations, as shown in Holm et al. [1998]. 
This level of generality requires tools of differential topology, such as the Lie 
derivative, the pullback and the pushforward, see for instance Abraham and Marsden 
[1978], Marsden and Ratiu [2013], Holm et al. [2009]. However, if the domain 
is a two-dimensional compact subset à embedded in R? or R? with Cartesian 
coordinates and appropriate boundary conditions, we can formulate the equations 
of motion also using vector proxies of exterior calculus. The computations are in 
arbitrary orthonormal coordinate systems. To this end, we start from the dimension- 
less Lagrangian for the thermal rotating shallow water (TRSW) equations, which 
are given by 


1 1 1 
Lreswnnd)= f (Glu? pe R ggg sb - 20) ndu. 
(1) 


In this Lagrangian, u is the velocity field, n = o£ + h is the total depth, ¢ is the 
free surface elevation, h is the bottom topography, R is the vector potential for the 
Coriolis parameter and b is the buoyancy variable. Furthermore, the dimensionless 
numbers are the aspect ratio o, the Rossby number Ro, the Froude number Fr, the 
wave amplitude o and the stratification parameter s. The stratification parameter 
represents the importance of the buoyancy variable, which itself is of order one. The 
vector potential for the Coriolis parameter satisfies V+-R = f (x, y), where f (x, y) 
is the usual Coriolis parameter. The -L operator corresponds to a Hodge star operator, 
which for Cartesian coordinates is defined by (x, y)t = (—y, x). The volume form 
is given by du, which in Cartesian coordinates is expressed as du = dx dy. The 
Lagrangian is a functional on ¥(Q) x V*(Q), the product space of the space of 
vector fields ¥(Q) and the space of advected quantities V*(Q). That is, u € X(Q2) 
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and y, b € V*(Q). More information and details on these spaces can be found in 
Luesink [2021]. The thermal rotating shallow water equations associated with the 
Lagrangian (1) are given in the advective formulation by 


9 kei pe 2 Y + sb) + — (at — h)Vb 
9 Ee EUM 
ot Ro Fr? 2 Fr? : 


0 
mE : = 2 
3i n+ V- (qu) = 0, (2) 


ð 

x? t (u-V)b — O0. 
Equations (2) describe a compressible fluid with thermal effects in a rotating frame 
in two dimensions. Note that the equations take the same form on the sphere. 
The bold symbols denote vector valued quantities, which assumes the existence 
of a basis. Since we are working with two dimensional manifolds, such a basis is 
available. We will now recall an important vector calculus identity that is central 
to the manipulations that we will perform. This identity follows from differential 
topology, as shown in Holm et al. [2009], where vector fields and differential forms 
in arbitrary coordinate systems can be defined. The vector-valued coefficients of 
these objects can be identified when the underlying space is Euclidean with the 
standard inner product. This means that u is simultaneously the coefficient of a 
vector field and a differential 1-form. This can lead to confusion, so in what follows 
u denotes a vector field in arbitrary coordinates and v denotes a differential 1-form. 
The Lie derivative can be defined as the derivative of the pullback 


d * 

Luv =| AEQ). 3) 
where ó is the flow associated with the vector field u. The asterisk on $ means that v 
is pulled back along $. Here, ¢ is the solution to the differential equation d$ /dt = 
u(Ó, t) with arbitrary initial condition, i.e., representing a trajectory of a test-particle 
in the flow field u. The arbitrariness is what allows Lagrangian formulations of fluid 
dynamics to be related to Eulerian formulations. The Lie derivative can also be 
defined by means of Cartan's formula 


Luv = iydv + d(i,v), (4) 


where i„ v denotes the interior product of u with v and d is the exterior derivative, see 
e.g. Abraham and Marsden [1978], Marsden and Ratiu [2013], Holm et al. [2009]. 
Setting the two definitions (3) and (4) of the Lie derivative equal to one another in 
R? provides the following vector calculus identity 


(u - V)v + (Vu) - v = (Vt - v)u* + V(u - v), (5) 
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where the left hand side results from the pullback definition (3) and the right hand 
side follows from the Cartan formula (4). 

Using (5), an alternative formulation to (2) of the thermal rotating shallow water 
equations is given by 


P (o+ x) fa Lad uta. Sb 
at Ro’ Fr 2 2Fr 
ð 
z” tV 0u) =0, 
EG Sobcsn 
Ot 
(6) 
where œ = V+ - u is the scalar vorticity. This formulation is often called the 


vector invariant formulation of fluid dynamics. There are several situations where 
the vector invariant formulation has advantages over the advective formulation. 
The vector invariant form offers an easier basis for numerical discretisations 
because it hides the nonlinearity. Additionally, on the right hand side of the 
velocity equation in (6), one recognises the gradient of the Bernoulli function 
B= ilu [^ + -2- (1-- sb)£, further explaining the physical mechanisms responsible 
for the flow of fluid. 

Both (2) and (6) can be formulated as a Hamiltonian system with a Lie- 
Poisson bracket. This is a crucial property of these models as it prepares for a 
clear formulation of conserved quantities, i.e., the desired Casimirs. By Legendre 
transforming the Lagrangian (1), we obtain the Hamiltonian associated with the 
thermal rotating shallow water equations. The Legendre transform is given by 


H (m, n, b) = (m, u) — Liu, n, b), (7) 


where m = óL/óu is the momentum variable in terms of the functional derivative 
of the Lagrangian. The Legendre transform only relates the velocity and the 
momentum and does not affect the advected quantities. The Hamiltonian is therefore 
a functional on the space ¥* (2) x V*(Q), where X*(€2) is the dual space of X(€2) 
with respect to L?-pairing on Q. For the thermal rotating shallow water model, the 
momentum is given by m = n(u + R) and the Hamiltonian is 


Ex cH 
H = = — (1 — 2h l 
TRsw(m, n, b) f (zm tme + sb) ) ndu (8) 
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The Poisson bracket that will allow us to formulate the equations of motion has the 
following general form, see e.g. Holm et al. [2009], 


F/m 8G/àm 
(F,G)- f 8F/8n | Jan, n, b) | 8G/ên | du, (9) 
2 V8F/8b 8G/ób 


where F and G are two arbitrary functionals on the space X*(Q) x V*. The matrix 
J is operator valued and depends on the state. J can be explicitly derived using 
reduction theorems, following Marsden and Weinstein [1974], Holm et al. [1998]. 
We speak of Hamiltonian dynamics if the evolution of a functional F is expressed by 


d r—-(F.H) (10) 
dt exe , , 


where H is the Hamiltonian of the system. We obtain the equations of motion (2) 
by choosing G = H and using J (m, n, b) as in Holm et al. [2021] to find 


a [mi m0; + djmj ndi —bi\ (8HrRsw/ómj 
al? |= jn 0 0 óHrnsw/ón (41) 
b bj 0 0 ó Hr gsw /ób 
a 
=J(m,n,b) 


and reproduces Eqs. (2) after using the definition of the momentum, rewriting the 
momentum equation using the continuity equation and applying the Lie derivative 
identity (5) to the vector potential of the Coriolis parameter. We have used Einstein’s 
summation convention of summing over repeated indices and the notation b; refers 
to i-th component of the gradient of b (the comma denotes a spatial derivative). Note 
that the dependence on the state (m, 7, b) is linear. This means that the Poisson 
bracket is in fact a Lie-Poisson bracket. The kernel of a Lie-Poisson bracket is 
nontrivial and is precisely the kernel of the linear operator J. The kernel is spanned 
by functionals known as Casimirs and the goal of the remainder of this work is to 
derive these Casimir functionals explicitly. 

In Holm and Long [1989], the momentum bracket (11) is transformed by 
means of a change of variables to the rotational form. The change of variables is 
(mi, m3, n, b) +> (u,v, n, b), where u = ™ and v = 72, which is invertible as 
long as 0 < n < oo. This change of variables is not the same as just formulating 
the Hamiltonian and the Lie-Poisson bracket in terms of velocity because of the role 
of the rotating frame. Performing the transformation of Holm and Long [1989], we 
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obtain the following formulation 


u 0 —4 dx -7 ôHrrsw/ôu = nu 
9 vy _ q 0 dy Ez ôHrrsw/ôv = qv (12) 
ðt | m 8,9, 0 0 ôHrrsw/ôn = B |" 

b E z 0 0 ôHrrsw/ôb = T 


=J(u,v,n,b) 


where g = 7 (w+ gs f) is the potential vorticity, B = |u|? + zz +sb)(n—2h) 
is the Bernoulli function and T = sen? — 2nh). This Hamiltonian formulation 
corresponds to (6) and extends the bracket of Holm and Long [1989] to include 
thermal variations. This bracket is linear and it is a simple exercise to show that it is 
skew-symmetric with respect to the L?-pairing. To prove that it satisfies the Jacobi 
identity, we repeat the argument of Holm and Long [1989] and state that the bracket 
in (12) is an invertible transformation of variables in the Lie-Poisson bracket (11). 

We now wish to express the bracket (12) in another set of variables to achieve 
a formulation in independent scalar variables. This is accomplished using the 
transformation (u, v, y, b) œ> (œ, D, n, nb), where w = V+. u is the vorticity and 
D = V - wis the divergence. If the domain Q has no holes, then the vector Laplace 
equation has a unique solution. This is the Helmholtz theorem for Euclidean space. 
If the domain does have holes (or islands) then appropriate boundary conditions 
are required to eliminate harmonic functions in order to guarantee uniqueness 
of solutions. In either case, provided with appropriate boundary conditions when 
necessary, one can uniquely split up the vector field u into potentials via the 
Helmholtz decomposition 


u = Vy + Vx, (13) 
where y is the stream function and x is the velocity potential. Uniqueness of 


solutions to the Laplace equation is required to reconstruct u, since the potentials 
satisfy Poisson’s equations 


(14) 


This permits us to write the velocity field in terms of vorticity and divergence as 
follows 


u=VtA!w+VA7'!D, (15) 
which means that changing variables in the Hamiltonian amounts to substitut- 


ing (15) for the velocity. The first term in (15) is analogous to the two-dimensional 
version of the Biot-Savart law, but since the fluid is compressible it does not 
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determine the full velocity field. The potential buoyancy variable r = b/n 
is introduced to remove the fraction terms +b,/n and +by/n in the Poisson 
bracket (12). This is convenient from a notation point of view, but also means that 
all the variables that appear inside the J matrix in the Poisson bracket are "potential 
variables", i.e., potential vorticity and potential buoyancy. To change variables in 
the Cartesian coordinate case, we sandwich the Poisson bracket of Eq. (12) with the 
functional Jacobian derivative operator and its L?-adjoint as follows 


by 
—a, a, 00\ (9 —49x —7X (à, —8,00 


à a,00||«a 0 3,-2||—-3, -3 00 do 
0 010|[3,3,0 0 0 0 10 
0 001 be h 0 0 0 0 01 


Similar manipulations can be performed for different coordinate systems, provided 
that one takes an orthonormal coordinate basis. For the sphere this would be a 
latitude-longitude basis. Performing this change of coordinates in the bracket (12) 
leads to 


àF/8o qx —qOrx 8G/ào 
8FJ/8D qqxlr $G/8D 
F,G| =- V. V du. (7 
| ji 8F/ôn 0 -10 0 8G/ôn | ^f qn 
8FJ8b rx -r00 8G/ôb 
——— 
=J (w, D,n,b) 


This bracket formulation has a number of interesting properties. First of all, the 
differential operators can be factored out of the matrix upon introducing the 
convention xV = V+. Secondly, the matrix only features the potential vorticity 
q and the potential buoyancy r. The kernel of this bracket is the kernel of the 
matrix operator J(w, D, n, b), which is skew-symmetric in both q and r. The most 
interesting property in our opinion is that this bracket governs a number of two- 
dimensional fluid models through submatrices of J. 

The full four by four setting corresponds to the thermal rotating shallow water 
equations. If there is no underlying rotation, one simply adapts the definition of 
the potential vorticity q to obtain the thermal shallow water equations. When there 
are no buoyancy variations, the bracket can be restricted to the three by three case, 
which corresponds to the rotating shallow water equations. Again, upon adapting 
the potential vorticity variable g, one can obtain the shallow water equations in 
case rotation is absent. In the incompressible case where buoyancy variations still 
play a role, the divergence is zero. Then one can use the submatrix consisting of 
the (1, 1), (1, 4), (4, 1), (4, 4) elements, which corresponds to the thermal rotating 
Euler equations. If buoyancy variations do not play a role, the matrix is simply the 
(1, 1) element, which can describe the two-dimensional rotating Euler equations 
and the quasi-geostrophic (QG) equations. Important to note is that the thermal 
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Table 1 The models that can be described by submatrices of the Poisson bracket (17) 


Incompressible Compressible 
1. Euler 3. Shallow water 
2. Rotating Euler 6. Rotating shallow water 
3. QG T. Thermal shallow water 
4. Thermal rotating Euler 8. Thermal rotating shallow water 


QG (TQG) model described in Warneford and Dellar [2013], Zeitlin [2018], Holm 
et al. [2021] does not fit into the Lie-Poisson bracket formulation because its (1, 1) 
position features q — b rather than just q, see Holm et al. [2021]. This is a result of 
the fact that the TQG model is derived as a perturbation around thermal geostrophic 
balance, rather than around geostrophic balance. To summarise, we list all models 
that can be described by (17) in order of complexity in Table 1. 

The transition from incompressible models to compressible models is a steep 
increase in complexity, since the compressible models involve an additional two 
equations compared to the incompressible case. The divergence and the depth 
variable are always paired together, since changes in D imply changes in 7, and vice- 
versa. This bracket is particularly useful in the explicit computation of Casimirs, 
which is shown in Sect. 3. It also has other uses. In ocean dynamics, gravity 
waves propagate at speeds that are orders of magnitude higher than the typical 
flow velocity. If the Rossby number, the Froude number, the wave amplitude and 
the stratification parameter satisfy O(Ro) = O(Fr) = O(a) = O(s), then one 
can derive the thermal geostrophic balance condition. This condition provides an 
algebraic relation for the balanced velocity field. The balanced velocity field is 
divergence free. Since the bracket (17) features the divergence variable explicitly, 
it is natural to perform an asymptotic expansion in a small parameter e where 


w = wo + €o1 + o(e), 


D = é Dı + o(6). 


(18) 


The expansion (18) applied to the thermal rotating shallow water Lagrangian (1) 
and truncated at order o(1) yields the thermal extension of the L1 model of Salmon 
[1983]. This derivation is shown in detail in Holm et al. [2021]. 


3 Explicitly Determining the Casimirs 


The bracket (17) is particularly helpful in the explicit computation of the Casimirs 
since the variables are all independent scalars. We are looking for functionals 
C(o, D, n, b) such that (F, C) = 0 for any functional F. The following compu- 
tations follow a procedure of step-by-step elimination. Expanding the bracket and 
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requiring ( F, C} = 0 for any F yields the equation 


SF re 8C 8C 
Üele =- | -wlgwo— -qV — rv 
Bye) o ôw (« jo pm 3 


ôF ôC ôC ôC ôC 
+ —V-|qV—-+qV +V +rv 
w n ôb 


(19) 


Since F is an arbitrary functional, we can solve for this functional equation per 
variational derivative of F and get the explicit form of the Casimirs by a process of 
elimination. In fact, the third line of (19), the one that features ôF /ônņ, will be our 
starting point in the explicit computations. The third line implies that C may be at 
most linear in D with constant coefficients, since only then the variational derivative 
of C with respect to D is constant in space. Applying the gradient leads to zero, 
meaning that the third term vanishes under the assumption of linear dependence 
of C on mass density D. If the variational derivative of C with respect to D is 
not constant with respect to space, then this term does not vanish, as AD is not 
necessarily zero. So at this stage we know that 


C(a, Ds. = f y D 4t f(o,n,b)dy, (20) 
Q 


where y € R is a constant and f (c, 7, b) is a function that is determined next. Since 
we have established that C must be linear in D, all terms that involve variational 
derivatives of C with respect to D vanish. Simplifying (19) accordingly, we obtain 


SF 8C 8C 
0={F,C}=- | V Hl ev ar = 
uae) NUT (a $5 7 e) 


ôF ôC C ôC 
+—V-(qV—+V—+rv (21) 
óc n ôb 


The third line of (21), the one that features ôF /ôb, vanishes if C depends linearly on 
w. In this case we do not have to insist on constant coefficients. If the coefficient of w 
is an arbitrary differentiable function V (r) of the potential buoyancy, i.e., 5C/da@ = 
V (r), we have 


v(r m) = V.(rv^w(r) = w'oyvr-vir-r(v.v^wver)-0, Q2 
w 
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because Vr, Vr and V, V+ are orthogonal. Here V'(r) denotes the derivative of 
V with respect to its argument. Recall that the potential buoyancy is defined as 
r = b/n. At this stage, we know that C must have the form 


C(o, D, »b- f y D - oV(r) - g(b,n) du. (23) 
Q 


The next step is to determine g(b, n). Knowing the explicit form of the variational 
derivative of C with respect to c, we can simplify (21) further to obtain 


8F 8C 
o- (os - f —V-.[gvw(r 4 rvt — 
Q ôw ôb 
(24) 
piy vu( 4 von 4 vit d 
—V-e r — +rv— . 
sp. M án mp 


We focus on the term that features ôF /ôœw. We have an explicitly constructed term 
and the variational derivative of C with respect to b. By the same argument as in 
the previous step, we know V - (rV (6 C/ób)) vanishes if the variational derivative 
of C with respect to b is a differentiable function of r. So, let P(r) = (b/n) be 
this differentiable function. Since a variational derivative of ®(b/nņ) with respect 
to b produces a factor of 1/5, we introduce the term n® (b/n) into C. So at this 
stage, after the three steps, we have an expression for C in terms of two arbitrary 
differentiable functions and Y 


C(o, D, n= f n®(r)+nqv¥(r) +yDdu (25) 
Q 


where y € R is a constant and ng = œw + ae f. Going back to (24), we can verify 
whether C (w, D, n, b) is indeed the family of Casimirs of the Poisson bracket (17). 
This means that the terms multiplying variational derivatives of F must vanish. So 
we substitute (25) into (24) and compute the term multiplied by ô F/5c 


V. (ev — avn + rtm) 
=y. (aviv) +rv+(o'(r) + qV'o)) 
2$" (ryvr- Vtr) +r (V -VHP (r) (26) 
+ V'a Vir) + Y'r Vta) 
+r” (r)(Vr - Vq) - rW'(r(Vq- Vtr) 
=0. 
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In this computation we have used orthogonality of V, V+ and Vr, V+r and skew- 
symmetry, i.e., Vr-Vtg = —Vq-Vtr (= —rxqy -- qxry for Cartesian coordinates). 
It remains to check whether the term multiplied by 8 F/5D also vanishes. In the 
following computation, we suppress the dependence of ® and W on r for notational 
convenience. We compute from the second line in (19) 


v. (43€ 4 gv hl LC 4 pl 
2 PEE r 
T Gg 47^ p. au 8b 


=V: (avw + V(> — r&' — rqu') +rV(e +4’) E 


"2 (avy — rV(qV.) — qW'Vr+ rviqv’)) 


= 0. 


The computation is a sequence of applying the identity V f(r) = f'(r)Vr. In the 
first step we have applied this to all the terms that involve ®. Performing the same 
manipulations on V subsequently, implies the result. Note that in (24) we required 
cancellations of terms orthogonal to the terms that are required to cancel in (27). 
The reason that the perpendicular gradient of the variational derivative of C with 
respect to 7 does not appear in (24) is because it is trivially zero due to orthogonality 
of V and V+. Hence we can conclude that (25) is the complete description of the 
Casimirs for the bracket (17) and thus also for (11). We can repeat the computation 
for each of the models described in Table 1 to obtain the corresponding Casimirs. 
This is summarised in Table 2. Here one can see that in presence of thermal effects 
enstrophy is no longer a Casimir. 


Table 2 The Casimirs of the models that can be described by submatrices of the Poisson 
bracket (17) 


Incompressible Casimirs 

1. Euler C = fo O(g)du 

2 Rotating Euler C= fo P(g) du 

3. QG C= fo) du 

4 Thermal rotating Euler C= fo Or) - qwmr)dyu 
Compressible Casimirs 

5. Shallow water C= fond (q) - yDdyu 

6. Rotating shallow water C = fon®(q)+yDdu 

7. Thermal shallow water C= fond) - aw(r) + yDdu 

8. 


Thermal rotating shallow water C= fo ndr) 4 nqw() + yDdu 
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4 Conclusion 


In geometric approaches to fluid dynamics one often may exploit constructive 
methods to infer conservation laws via Noether’s theorem Abraham and Marsden 
[1978], Marsden and Weinstein [1983], Marsden and Ratiu [2013], Holm et al. 
[2009]. Casimirs identify conservation laws that arise as the kernel of the Poisson 
bracket. We provided an explicit method of determining these Casimir functionals 
for two-dimensional fluid dynamics by means of two changes of variables for the 
thermal rotating shallow water equations. We formulated the equations in vector 
invariant form by using an important vector calculus identity. The Poisson bracket 
corresponding to this vector invariant form is convenient for further manipulation. 
By changing coordinates from velocity to vorticity and divergence, we derive a 
Poisson bracket that only involves (skew) gradients and divergences. By means 
of this formulation, it is a systematic computation to obtain the Casimirs. The 
computations were performed for arbitrary coordinate systems, which means that 
the above computations can easily used for domains such as the sphere. 
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Correlated Structures in a Balanced 
Motion Interacting with an Internal Wave se 


Igor Maingonnat, Gilles Tissot, and Noé Lahaye 


Abstract Characterizing the loss of coherence of an internal tide propagating 
through mesoscale turbulence has been a major challenge in oceanography, particu- 
larly due to its implications for the interpretation of satellite data. In this paper, we 
intend to study the correlations between a balanced motion and the incoherent part 
of a wave in an idealised configuration. We introduce a new modal decomposition 
technique, named broad-band proper orthogonal decomposition (BBPOD), which 
consists in performing a proper orthogonal decomposition (POD) on complex 
demodulated variables. After connecting BBPOD to the standard SPOD, we show 
that BBPOD, coupled with the extended POD technique enables us to associate the 
principal components of the incoherent field to the slow flow structures responsible 
of this loss of coherence through triadic interactions with the incident wave. 


Keywords Internal tide interactions - Spectral proper orthogonal decomposition - 
Broadband proper orthogonal decomposition 


1 Introduction 


Internal tides, generated by interactions between the barotropic tide and topographic 
features such as ridges or continental slopes, are ubiquitous in the ocean, playing 
a crucial role in vertical mixing and energy transport. They propagate over large 
distances, encountering regions with energetic mesoscale turbulence, and they 
lose their fixed phase relationship with the astronomical forcing, a phenomenon 
known as incoherence. This loss of coherence, highly unpredictable, complicates 
for example our ability to disentangle internal tides and low-frequency turbulent 
signals from satellite data (Richman et al. 2012). 
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To face these difficulties, there have been various studies to better understand 
and predict the impact of a balanced (turbulent) jet on the inertia-gravity wave 
field properties. For instance, Savva and Vanneste (2018) and Ward and Dewar 
(2010) studied the internal tide scattering. Ponte and Klein (2015) examined the 
incoherence time scales for different turbulent configurations, and Dunphy et al. 
(2017) quantified the interaction terms via vertical-mode projection of the linearized 
Boussinesq equations. 

We consider in the present paper a data-driven approach to study from idealised 
numerical simulations, the structures of the jet, which are correlated with the 
incoherent contributions of the wave field. Extracting coherent structures from a 
data set can be performed in the spectral domain by spectral proper orthogonal 
decomposition (SPOD) (Towne et al. 2018). Some attempts to connect these reduced 
features with non-linear interactions have been proposed for instance in Karban 
et al. (2022) by identifying the contribution of the non-linear term correlated 
with the SPOD mode through extended proper orthogonal decomposition (EPOD, 
Boree (2003)). Unfortunately, non-linear forcing is a quantity more difficult to 
interpret and associate with physical mechanisms than a pressure or velocity field for 
instance. We propose instead a new broad-band proper orthogonal decomposition 
(BBPOD), derived from the SPOD. By taking advantage of the strong time scale 
separation between the two dynamics and considering small-amplitude wave, this 
formulation allows us to connect the non-linear interactions between the slow 
motions of the jet and the incoherent contributions of the wave through the EPOD 
method. The non-linear interactions are here understood as triadic interactions with 
the slow motion that will lead to a broadband frequency structure of the incoherent 
wave field. 

The plan is as follows. We will begin by describing the model used for our 
simulations in Sect. 2. In Sect. 3.1, the original SPOD method is reviewed and a 
connection to the proposed BBPOD method is made (Sect. 3.2). Sections 4 and 5 
will summarize the study and bring some perspectives. 


2 Model 


The propagation of internal tides through a nearly-balanced jet is examined in a one 
layer rotating shallow water (RSW) model. The equations are non-dimensionalized 
as follows. The characteristic time-scale is the inertial time T = f~!, inverse 
of the Coriolis frequency. The reference length scale L is chosen to be of the 
order of the jet thickness, taken equal to the first Rossby radius of deformation 
Ra, such that the Burger number B, = RZ /L? is equal to one. A beta-plane 
approximation, with parameter 6, accounts for the effect of rotation, and a radiative 
damping term o is added on the continuity equation, (following e.g. Brunet and 
Vautard 1996). Adequat artificial hyperviscosity, consistent with energy dissipation 
and conservation of angular momentum, is also used (see Ochoa et al. 2011). 
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Let Q be a bounded subset of R?, the equations without viscosity, defined on 
Q x R+, are 


8h + B, div v + Ro((v-V)h + h div v) = —ah (1) 
dv + (1 + By) + Ro(v-V)v = —Vh, (2) 


where v = (u, v) is the horizontal velocity, vt = (—v, u)", h is the sea surface 
height (SSH) anomaly and Ro = U/(fL) is the Rossby number. 

A numerical simulation of system (1) and (2) of a plane wave interacting with 
a zonal jet has been performed. Equations have been discretized by a pseudo- 
spectral method and with a Runge-Kutta time-scheme using the code Dedalus 
(Burns et al. 2020). The domain Q is a doubly periodic rectangular domain of size 
[0, 20] x [—20, 20] discretized on a 128 x 512 grid. The simulation is initialized 
with an eastward zonal jet at geostrophic equilibrium with a small perturbation 
to trigger the instabilities. During the whole experiment, a northward propagating 
plane wave with frequency fixed at œw = 2 is generated in a nudging layer in the 
South of the domain, and an eastward wind forcing is applied to maintain the 
balanced current. A spin-up phase allows the jet to evolve toward a statistically 
stationary state. The experiment continues with snapshots stored every 0.1 wave 
period, such that a sufficiently long series q = (u, v, h)? of 12,000 snapshots are 
collected, representing nearly 4000 wave periods. Note that a sufficient sampling in 
time is required for extracting the wave field by filtering in Sect. 3.2. An example of 
snapshot associated with one run of the simulation is shown in Fig. 1. 
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Fig. 1 A snapshot taken from the RSW simulation. On the left the vorticity field and on the right 
the internal wave SSH anomaly ha, extracted by bandpass filtering centered around the wave 
frequency w = 2 
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This simple model can sustain waves interacting with a turbulent flow. Northward 
rotating shallow water waves are inertia gravity waves satisfying the dispersion 
relation 


, o — (1+ By)’ 

joo ue. (3) 

: B, 

with ky denoting the spatial wavenumber in the meridional direction, which is 
function of the tidal frequency w. 


3 Methods 


The methods to extract correlated structures in a flow field are described in this 
section. In the following we denote our state vector by q = (u, v, h)”. 


3.1 Spectral Proper Orthogonal Decomposition 


Spectral proper orthogonal decomposition is an extension of POD methods (see 
Lumley 1967), and aims at extracting coherent structures in spectral space from 
numerical or experimental data. The data are assumed to be issued from a 
statistically stationary random process, which is verified thanks to the hypothesis 
of small-amplitude waves, such that the balanced flow is marginally impacted by 
the wave propagation. The modes obtained from this decomposition are space-time 
uncorrelated from each other. 

We interpret our state vector as a zero-mean second order random process q 
(which can be obtained by subtracting the time averaged field beforehand) indexed 
over Q x IR*, a subset of spatio-temporal variables. Consider q(w) the Fourier 
transform of the process at angular frequency w. We assume that each realization 
of q(@) belongs to Lip. C3) = {g : 2 —> C, Jo g*(z)W(z)g(z)dz < oo], the 
ensemble of square integrable functions in space (periodic at boundary) relatively 
to a positive definite weight matrix W: Q — M,,(R). The superscript -* denotes 
the transpose-conjugate operation. The matrix W is chosen such that the L?-norm 
approximates the energy of the RSW model (1) and (2): 


| 2 t f 4 aya? + v ded : h? dxd 4 
dlw = 5] Ct )(u + uv") dxdy + 75 7 y, (4) 


u 


where A refers to the deviation of the isopycnal taken equal to R,/B, and h 
corresponds to the temporal-mean of SSH anomaly. The term 1 + A/ is assumed to 
remain strictly positive (which is necessary for ensuring positiveness of the norm), 
since we do not consider zero or negative sea-surface height. 
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Let us recall briefly the basic principle of SPOD. By stationarity the auto- 
correlation function C(x, y, t, x’, y', t^) = E(q(x, y, t) @ q*(x’, y’, t’)) satisfies: 


C(x, y, tx, y, t) = C(x, y, x’, ys t =t’), (5) 


where ® denotes the dyadic product of two vectors in IR?. The objective of SPOD 
method is therefore to find a deterministic function v satisfying the Fredholm 
equation: 


ri Sera r Wl wee See, © 


where S(x, y, x’, y', œ) = [^ C(x, y, x’, y, t)e !^' dr is the cross spectral density 
matrix (CSD). 

The Karhunen-Loeve theorem (Loéve 1955) states that Vw € R, Eq. (6) has 
an infinite number of solutions (A j(@), V (@)) 1 such that: (y j ($4 is an 
orthonormal basis in ES, C?) where we can expand the Fourier transform of 
the field into structures uncorrelated from each other: 


Q(x, y, o) =) aj(o)¥ jx, y, o) (7) 


j=l 


with E(aj(@)aj(w’)) = Xj(w)8j, (o — o!) and Vj,4;(@) = 0. Moreover, a 
truncated expansion at order n will maximize the mean energy (defined by the norm 
IE lap compared to any other decomposition of the same order, and SPOD provides 
an optimal decomposition of the CSD: 


XI Go) lw) = 3 100 (8) 
j=l 
S(x, y, x^ y o) = DAO), y, WF", y^ o). (9) 


j=l 


Therefore, by solving the Fredholm equation at the tidal frequency, we are 
able to separate the fast from the slow component of the process by a spectral 
decomposition, expanded onto an orthonormal basis optimal energetically. 

It can be remarked that when there is an homogeneous direction, it is possible 
to compute the Fredholm equation in spectral space with respect to this direction. 
Each wave-number can be computed independently, and each corresponding Fourier 
component is solution of the original Fredholm problem. For more complete 
description of SPOD (see Towne et al. 2018; Schmidt and Colonius 2020). In our 
case, the domain is homogeneous in the x direction, but we still consider the 2D 
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problem for the physical relevance of the extended BBPOD problem presented in 
Sect. 3.2. 


3.2 Broadband Proper Orthogonal Decomposition 


The BBPOD consists in estimating by complex-demodulation (see Godfrey 1965) 
the correlation tensor of a band-passed-filtered signal, and its eigenfunctions. The 
associated algorithm is presented in this section. It will be compared with the 
Welch’s method used in SPOD (see Welch 1967; Towne et al. 2018) and it will 
be shown that under some hypotheses both algorithms are equivalent. It can be 
noted that in Welch (1967), the connection with complex demodulation is briefly 
mentioned, and it is leveraged here with the computation of the eigenvectors and 
eigenvalues of the correlation tensor for a POD decomposition. We highlight by this 
equivalence, that due to windowing in the Welch's method, the features extracted by 
SPOD possess a spectral component with a thick frequency band. For the Broadband 
POD algorithm, this frequency band is explicit, and chosen through the definition 
of a filter. In the RSW model, the incoherent wave field possesses a broad band 
structure and Broadband POD allows us to obtain a complete decomposition of this 
field. 


3.2.1 Complex Demodulation of the Wave Field 


Given a temporal series x;, the principle of complex demodulation consists in the 
following computation: 


xa = £(ue 19, (10) 


where xg is called the complex demodulated signal, £ is a low-pass filter and w 
is the demodulated frequency. Complex demodulation enables to extract the slowly 
varying amplitude envelop and phase deviations of the wave field. These variations 
are associated with wave incoherence. 

First, we decompose the process q(x, y,t) in a sum of q;(x,y,t) and 
Q(X, y. t), representing the jet and wave contributions, respectively. The wave 
field can be expressed as: 


qox yt = qax, y, Del", (11) 


where w is the tidal frequency and qg is a complex field slowly varying in 
time, accounting for the incoherence. The latter is then extracted by the complex 
demodulation of q (10), with a filter designed such that it isolates the slow variations 
of the background flow. As a consequence, through the e^/^' shift, the spectral- 
band of the extracted wave contains all effects of triadic interactions between the 
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slow motions and the coherent wave. The jet contribution q ; is obtained by directly 
filtering q (zero frequency), including here the time-average. In this study, we 
assume that q is dominated by the superimposition of q ; and q,,, which interact non- 
linearly. Note that for simplicity, a uniform time sampling of the data was chosen 
but this can be adapted to a non-uniform one, e.g. by interpolation on a regular grid 
or using filtering algorithms to irregular sampling. 


3.22 Link with SPOD 


For making the link with SPOD to estimate the CSD of a signal x;, we write the 
temporal filter in a discrete form, with a time spacing At and f; = kAt. A wide 
class of linear filter can be expressed as the convolution: 


Les S^ bien (12) 


i—-—m 


where (b;)_m<i<m are the discrete coefficients of the filter. Then, 


m 
(Loe); = b» byxj-ye "ti (13) 
k=—m 
j+m 
= Y bje, (14) 


k=j—m 


In the Welch method, x; is subdivided into possibly overlapping blocks of size N 
and with an overlap No. A Fast Fourier Transform is performed on each windowed 
block to extract the Fourier component at the tidal frequency, denoted X p where / 
is the block index. We define 


N 


Xi, = Y xN -No Wie Ih, (15) 
k=0 


where Wy is a window function. By changing variable k’ = k + 1(N — No), we 
obtain 


N-HGN —No) 
x = eo! lN-No) > xp We_un—nye ioe. (16) 
k’=I(N—No) 


Assuming that the window function is symmetrical in the middle of each block 
(which is verified for most windows used in the literature), i.e. Wg = Wny_x, 
Eq. (16) gives 
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N-H(N —Nso) 

x = gl li(N -No) 5 Wy-A4(N—N,)-k Xy e O., (17) 
k'=1(N—No) 

Finally, by choosing Wg = bg which sets m = X, relation (14) yields 
I i cot] (N — —iwt 
xl, = i NN-No) (p (x, eio DERA (18) 
Consequently, up to a phase, the Welch method corresponds to the computation of 


the complex demodulated signal sampled every N — No. The phase shift cancels 
when computing the correlation over Np blocks: 


Np Np 

l yl * —icot —iwt 
J XX» = ) (Lixje'® DN 41—N,) E Ore um DN FINN) (19) 
1-0 1-0 


The CSD can thus be obtained by computing the correlation over Np snapshots 
sampled every N — No of the complex demodulated signal, which is proving the 
equivalence between the two methods for appropriate numerical parameters. 

As said before, it can be remarked that the window function acts as a filter in the 
Welch procedure, but without giving an explicit expression of the frequency band. 
Moreover, if we aim at studying the whole broadband field, the classical SPOD 
(with its single frequency interpretation when the window function is designed to 
only estimate a pure harmonic) requires the computation of a set of spatial modes 
at each discrete frequency in the peak. In comparison, BBPOD computes at only 
one frequency the dominant modes of the wave field. This has the drawback for 
the SPOD algorithm to include at each frequency a few modes of nearby frequency 
due to the convolution with a finite length window, leading to misinterpretation or 
counting the same mode several times in the reconstruction of the whole wave field. 
More precisely, BBPOD gathers the first SPOD mode of each frequency providing 
a simpler representation of the whole wave field and avoids spurious modes in the 
reconstruction. Another important remark is that the connection of our method with 
SPOD lies on the fact that the spectral peak is sufficiently narrow corresponding of 
a sufficiently large window in the Welch method guaranteeing a good estimation of 
the CSD. 

The final step is to compute the eigenvalues and eigenvectors of the CSD, 
estimated by complex demodulation. The complex demodulated signal at frequency 
w is then decomposed on this BBPOD basis: 


qa(x. y. t) = ajo, Wj (x, y. o), (20) 
J 


with aj(w,t) = Jo q5G. y, 0WG, y) (x, y, e) dxdy slowly varying coeffi- 
cients. 
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3.2.3 Extended Broadband Proper Orthogonal Decomposition 


Non-linear interactions between slow variations of the background flow and the 
wave induce incoherent wave contributions through triadic interactions. A major 
interest of BBPOD is that it allows us to study the correlations between the slow 
perturbations of the jet, and the incoherence of the fast wave field, by extracting the 
slowly varying complex demodulated amplitudes. To that end, we propose to apply 
the concept of EPOD in the framework of BBPOD. EPOD, originally presented in 
Boree (2003), enables to identify the part of a target field correlated with a given 
POD mode. The target field we first consider is the balanced motion obtained by 
low-pass filtering. The p-th extended POD mode of the slow motion q/ correlated 
to the p-th BBPOD coefficient of the wave ap is defined by: 


j _ E@/ap@)) 
w= ^ (21) 


where the expectation operator is a temporal average over snapshots. The mode 
x5 will be referred to as direct EPOD, and ay (o) x» represents the part of the jet 
correlated with the p-th broadband POD coefficient of the wave. 

Complementary to direct EPOD, we define as well an inverse EPOD, applying 
BBPOD for w = 0, thus obtaining an orthonormal basis representative of the jet 
variability, and considering the complex demodulated of the wave contribution qq 
as the target field. In this situation, we identify the contribution of the incoherent 
wave field correlated with the BBPOD coefficients of the jet noted a, (xw = 0). 
Therefore, in the following direct EPOD refer to jet modes while inverse EPOD 
refer to wave modes. 


4 Results 


This section details the numerical results carried out in this work. The goal is to 
understand the non-linear interactions between the balanced motion and the wave 
field by means of the methods presented in Sect. 3, that will extract the correlations 
between the two dynamics. 

Figure 2 shows the real part of the first three energy-scaled BBPOD modes at 
the tidal frequency, sorted by decreasing energy (4). Specifically here, we plot the 
scaled quantities Jj vj to highlight their respective energy contribution, but the 
analysis was performed with the normalized modes. To define the low pass filter 
L, a fourth order Butterworth filter has been taken with a frequency cut-off equals 
to the typical frequencies of the jet, such that the whole wave-field scattered by 
the jet is captured. The first mode containing the most energy corresponds to the 
coherent wave field, that is defined here as the part of the wave correlated to the 
tidal forcing and thus phase-locked in time, since it corresponds to the time-average 
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Fig. 2 Real part of the first three energy-scaled BBPOD modes 4/A jy; associated with u at the 
tidal frequency. Sponge regions are not shown 


Table 1 Normalized energy Mode 9 QN 
of the first 5 modes shown in 
percentage. Computed in €2 1 57.35_| 29.04 
and in Q”. Each eigenvalue 2 13.81 | 12.98 
is divided by the total 3 9.86 | 9.51 
variance in Q at the tidal 4 3.86 | 3.77 
frequency 27; AF (f) 5 234 | 213 
Sum of incoherence | 42.64 | 37.91 


of the complex-demodulated signal. Other modes account then for the incoherent 
structures as space-time decorrelated of the mean. They show deflections in opposite 
direction, with mode-number mx = +2, and with meridional mode-number my 
expected to be determined by the dispersion relation (3) and the structure of the jet 
(see Bühler 2014). This is a consequence of the homogeneity in the zonal direction, 
guaranteed by our idealised set-up, which makes SPOD modes equal to Fourier 
modes in x. 

Table 1 shows the normalised energy contained in the first five BBPOD modes 
and in the incoherence field. It is computed for the domain without sponge regions 
noted Q = [0,20] x [—12, 12] and for the subset QY = [0,20] x [—2, 12] 
representing the region of incoherences. 

In the total domain, the coherent plane wave accounts for 57.35% of the energy 
and the first three modes 81% of the energy. This shows that linear effects dominate 
the wave propagation, because we are in a configuration for which incoherence 
is relatively weak. As explained in Ward and Dewar (2010), a more energetic jet 
interacting with a tide with higher frequency would result in a more energetic 
incoherent wave field for a RSW simulation. As expected, incoherent modes have 
nearly all their energy in the upper part while it is rather equally distributed for 
the coherent mode. In this region, incoherences are dominant due to non-linearities 
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Fig. 3 Cumulative energy x Aj (m)/ x Àj (@)[%] contained in the reconstruction of the 
incoherent field by BBPOD modes 


in the center of the domain, which leads to an increase of the amplitude of the 
modes with y, even though the coherent mode still contains the most energy. 
Figure 3 shows the energy contained in the reconstructed incoherent wave field 
in terms of cumulative energy. The first two incoherent modes account for 58% 
of the total incoherent energy. For 6 modes, 80% of the energy is recovered in the 
reconstruction. The cumulative energy can be understood as a normalized RMSE 
measuring the accuracy of the basis to reconstruct the true incoherent wave field q;,,. 
Elldin- X= ine V V e) 
E(lidincll7 gy) 
be remarked that spatial coherence can be computed with the coherence function y 
through the CSD. 

A SPOD decomposition was also performed in Egbert and Erofeeva (2021) for 
a realistic HYCOM simulation. Similar results were found for the energy captured 
by the SPOD modes at the M2 frequency, which is similar to our forcing frequency. 
This encourages to extend this analysis to a more realistic simulation. 

Figure 4 depicts the direct EPOD modes of the balanced motion, correlated with 
the BBPOD modes of the wave. The modes are weighted as explained before. The 
first extended mode is showing that the part of the jet correlated with the coherent 
field is the stationary mean flow, which connects the first order statistic of both 
dynamics. This results is the expression of the linear propagation of the incident 
wave through the mean flow (in time and in the x-direction). The second and third 
EPOD modes represent the meanders of the jet, consisting of a vortex train. The 


according to 


=1 E? Aj (@)/ joi Aj (0). It can 


modes are defined up to a complex constant. We can see that xi is approximately 


proportional to xi. suggesting that meandering of the jet generates eastward and 
westward propagating incoherent perturbations. 
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Fig. 4 On the left column are presented the real part of the first three BBPOD modes of u at the 
tidal frequency. On the last column are the corresponding direct EPOD modes of u. Modes are 


scaled as follows: /A jj 


The opposite procedure is done next by inverse EPOD. Figure 5 shows the 


leading three jet (energy-scaled) BBPOD modes associated with œ = 


O and the 


associated EPOD modes of the wave. As in direct EPOD, the first mode is showing 
the correlation between the mean of both dynamics. For the second and third modes, 
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Fig. 5 On the first column are presented the first three BBPOD modes of u of the jet. On the last 
column are the associated inverse EPOD modes of u. Modes are scaled as follows: J/A j Wj 


BBPOD shows jet meandering structures identical to the direct EPOD modes (up 
to a phase). However, their associated inverse EPOD indicate a standing wave 
corresponding to the sum of the two direct broadband POD modes that represent left 
and right deflections, expressing the fact that the EPOD and BBPOD association is 
not bijective. A given meander will give rise to a wave deflected eastward nearly as 
much as a wave deflected westward, and their relative contribution is not possible 
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to predict, but the present result suggests that an approximately equal repartition 
would constitute a fair estimate. 

Globally, we can infer that the most energetic triadic interactions between the 
jet and the plane wave are caused by meandering structures of the jet generating 
an incoherent field of the form of a standing wave in the zonal direction x and 
propagating northward. The rising of a standing wave suggests in average an equal 
distribution of the eastward and westward propagating modes. This gives nice 
perspectives of incorporating this methodology in the context of data assimilation to 
estimate the incoherent wave field with some knowledge of the slow balanced flow 
through triadic interactions. 


5 Summary and Perspectives 


This study investigates the correlated processes at stake within the non-linear 
interactions of a plane wave propagating through a meso-scale oceanic current. For 
that purpose, the method of broadband POD is introduced and connected to extended 
POD. A first methodological point is done to show that BBPOD is equivalent to 
the common Welch method to estimate the cross spectral density matrix, to which 
SPOD are the eigenfunctions. More importantly for our study, connecting these 
two methods allows us to understand that the most energetic features of the wave 
are obtained by triadic interactions with the most energetic features of the jet. In 
our idealised configuration, the wave is deflected in both directions by a given 
meander. The spatial periodicity of the meander sets the zonal deflections, giving 
us clearer insight on the BBPOD of the wave knowing only the extended modes 
of the jet. This method can be seen as complementary to methods based on the 
bispectrum to detect triadic interactions in the data. Here we have an a priori on 
the nature of the interactions, between a low frequency jet and a fast wave field, 
allowing to study directly their correlations and to propose a methodology well 
suited for reconstruction. In a future work, we intend to examine the sensitivity of 
the results to other jet or wave configurations, and even to richer models. Studying 
the implications of these methods to reconstruct by correlation the internal wave 
field by the knowledge of the instantaneous slow motion is also envisaged for data 
assimilation problematics, either by direct estimation, or for learning an observation 
operator associated with a Galerkin model defined in the BBPOD space. 
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Linear Wave Solutions of a Stochastic A 
Shallow Water Model Check for | 


Etienne Mémin, Long Li, Noé Lahaye, Gilles Tissot, and Bertrand Chapron 


Abstract In this paper, we investigate the wave solutions of a stochastic rotating 
shallow water model. This approximate model provides an interesting simple 
description of the interplay between waves and random forcing ensuing either from 
the wind or coming as the feedback of the ocean on the atmosphere and leading in a 
very fast way to the selection of some wavelength. This interwoven, yet simple, 
mechanism explains the emergence of typical wavelength associated with near 
inertial waves. Ensemble-mean waves that are not in phase with the random forcing 
are damped at an exponential rate, whose magnitude depends on the random forcing 
variance. Geostrophic adjustment is also interpreted as a statistical homogenization 
process in which, in order to conserve potential vorticity, the small-scale component 
tends to align to the velocity fields to form a statistically homogeneous random field. 


1 Introduction 


Oceanic global circulation currents show a predominance of near inertial waves 
(NIW) in their spectrum. These waves result from the repeated forcing of atmo- 
spheric winds over the globe together with the influence of global earth rotation 
at Coriolis frequency. Besides, large-scale numerical representations of oceanic 
circulation require to introduce subgrid scale models that inescapably damp in the 
long run velocity fields and wave solutions. In such models high frequency waves 
are obviously completely smoothed out. But even large scale structures might be 
also affected if no particular attention is paid to the subgrid model design. 
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In the last year, there has been an increasing effort to devise stochastic parameter- 
izations for large-scale flows [2, 10, 11]. The motivations come from the failure of 
classical subgrid models to represent accurately the effect of the flow state variables 
at the unresolved scales and the necessity to provide reliable and computationally 
efficient models at the climatic scale. Uncertainty quantification, ensemble methods 
for forecasting and data assimilation are also prevailing, and the Bayes principle on 
which they are built leads naturally to considering stochastic dynamics. 

Stochastic dynamics can hardly be devised on ad hoc grounds if one wants 
to provide generic and flexible systems. Control of the variance’s growth and the 
respect of the physical properties of the underlying turbulent flows is of the utmost 
importance to define stable and physically relevant systems, in which the unresolved 
variables or the different physical and numerical approximations performed are 
faithfully represented. Two main schemes have been recently proposed to that 
purpose in the literature [12, 20]. The first one is a geometric framework relying 
on a Hamiltonian formulation, whereas the second one—referred to as modelling 
under location uncertainty (LU)—is based on Newton's principles and a stochastic 
formulation of the Reynolds transport theorem. Both schemes have been analyzed 
and numerically experimented for several geophysical models and configurations 
[1, 4, 5, 6, 17, 23, 25]. Surface waves and linear models have been proposed in the 
LU framework [8, 26, 27]. 

Nevertheless, the analysis of basic geostrophic mechanisms such as geostrophic 
adjustment or the form of basic wave solutions in linearized simple systems has not 
been investigated so far. This is the objective of this work. We will in particular 
analyze a stochastic rotating shallow water model recently proposed in [4, 16]. 
The focus will be on wave solutions of this system. We will show in particular 
that ensemble-mean waves ensuing from a random forcing at given frequencies 
are preserved whereas the others are very quickly damped. The stochastic model 
will allow us to propose a very simple mechanism leading to the emergence 
and conservation of NIW in atmosphere and ocean dynamics as a coupled self 
entrainment process. 

The paper is organized as follows. In a first section we briefly recall the proposed 
stochastic model—rotating shallow water under location uncertainty (RSW-LU). 
We then identify a stationary solution of this stochastic system. We next analyze 
and illustrate the plane wave solutions associated to the linearized system under 
some specific noises. Finally, we provide a picture of the geostrophic adjustment 
process associated to the random system. 


2 Review of RSW-LU 


Let us first recall the stochastic transport operator introduced in the LU framework 
[1, 18, 17, 24]: 
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D,O :=d,O + ((w - jV -a) dt + 0,dB;) -VO- jV -(aV@) dt =0, (19) 


where the tracer, ©, is a stochastic process with an extensive property (e.g. 
temperature, salinity, buoyancy), d,O(x) := O(x,t + ôt) — O (x,t) stands for 
the time-increment of © at a fixed point x with ôt an infinitesimal time variation, 
u denotes the time-smooth resolved velocity that is both spatially and temporally 
correlated, o;dB; stands for the highly oscillating unresolved noise component, 
assumed in the present study to be divergence-free, with its quadratic variation 
[22] denoted by a and Iv * à is the so-called Itó-Stokes drift [1] ensuing from 
the inhomogeneity of the noise. The mathematical definitions of the noise and its 
quadratic variation are given by 


eB, = | a(x,» DABO dy = YA, DEG. AB, 
neN 

a(x,t) — [5e v 08 0s ny = y O(£5;)G. 0. (1c) 
neN 


where o, is an integral operator defined on the Hilbert space (L2 (D))4 with a 
bounded spatial domain D C C? (d = 2 or 3), 6 = (6j)i, j-1,...,4 is a spatially 
and temporally bounded matrix kernel of ø+, 4, and &, are the eigenfunctions and 
eigenvalues of the composite operator 0,07 (o7 denotes the adjoint of o), e! stands 
for transpose-conjugate operation, B, is a cylindrical Wiener process [22] and 7 
are independent (one-dimensional) standard Brownian motions. 

Under the stochastic transport notations (1), the governing equations of the 
energy-preserved RSW-LU system [4] read 


(Conservation of momentum) 


iu + fout dt = —gVndt, (2a) 
(Conservation of mass) 
ih RN -udt =0, (2b) 
(Incompressible constraints) 
c,dB,— V^gdB, V-V-a=0, (2c) 
(Conservation of energy) 
1 
d, Í ~ p(h|u|? + gh?) dx = 0, (2d) 
D2 
where u = [u, v]" denotes the two-dimensional horizontal velocity with ut = 


[—v, u]", V = [dx, 8,]" stands for the horizontal gradient with yt = [—dy, dx]’, 
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h(x,t) = H + n(x,t) is the water thickness with H the flat bottom height 
and 7 the free-surface position, fo is a constant approximation of the Coriolis 
parameter, g is the gravity constant, p is the water density and dB; denotes a 
random (scalar) stream function defined in a similar way as in (1b). As shown 
in [4], the incompressible conditions (2c) for both noise and Itó-Stokes drift 
ensure the path-wise conservation of the total energy (2d). We remark that the 
analytical properties for the RSW-LU system have been investigated in [16] and 
some numerical applications of (2) have been performed in [4]. 


3 Stationary Solution 


We focus now on stationary solutions associated to the previous system. To that 
end, neglecting the time increments d;u and d;h in (2a) and (2b) and recalling that 
Vn = Vh due to the flat bottom assumption, we obtain 


1 1 
((u -5V +a): Vu — 5 V -(aVu) + fout + Vh) dt-+0,dB,-Vu=0, (3a) 


1 1 
(( x a) Vh — 53V: (aVh) c V -u) dt +0,dB,;-Vh — 0, (3b) 


o,dB,; = VtodB,, a=Vtg(Vt¢)’. (3c) 


From Bichteler-Dellacherie decomposition of a semi-martingale, the martingale 
terms (i.e. the Brownian terms) and the finite variation terms (i.e. the differentiable 
terms) can be safely separated. Decomposing in such way the mass equation (3b) 
and considering the corresponding martingale part leads to o;dB, - Vh = 0. 
The random surface height gradient is aligned with the large-scale surface height 
gradient 


o dB, =aVth, (4) 
The noise being divergence free yields that o is constant along the level sets of h: 
Va. Vth =0. (5) 


It is hence necessary of the form a = ¢(h) + C, with C a constant, and ¢ a 
differentiable function. For such a noise the variance tensor is given by 


a=oVthvtnh', (6) 


while the ISD reads 


Stochastic Linear Shallow Water Waves 227 
V-a=a°V-(VtAVth’) 2 a? (V-h- V)V.Rh, (7) 
and we notice that aVh = 0. The mass equation then boils down to 
1 
(u—5V-a)-Vh+h¥ -u =0. (8a) 
In the same way, we get from a semi-martingale decomposition of (3a) that 


o,dB; Vu = 0, (8b) 


which yields 
1 L 
(u — F a) - Vu + fou^ + gVh — 0. (8c) 


At this point it is interesting to interpret the random streamfunction in terms of 
nondimensional units to infer the importance of the Itó-Stokes drift (ISD) in compar- 
ison to geostrophic flows. To that end, we assume that dB; ~ /eT(gH/fo)VT, 
where T denotes the characteristic time scale and €T stands for a small-scale 
decorrelation time with e the strength of uncertainty (the greater e the stronger 
the noise is). From definition (3c), the noise's quadratic variation scales then as 
a ~ eT (gH)! /( foL)? with L the characteristic length scale. As a consequence, the 
ratio of the ISD advection term with the geostrophic gradient pressure scales as 


V-a)-V H L2 
(V -a) ME LE CN (9) 
gVh fL L 


where La = ./gH/fo is the Rossby deformation radius, and By denotes the Burger 
number, which stands for the ratio between vertical density stratification and the 
earth’s rotation in the horizontal (Bu = (NH/QL)*? = (L4/L)). If the ISD 
advection has the same importance as the gradient pressure term, then we must 
have ye = L/L4. This means that when the scale of motions is greater than the 
deformation radius, the rotation will dominate and the noise must be important 
to have the ISD playing a role. At the opposite, when the scale of motions is 
smaller than the Rossby radius, the small-scale flow component does not need to 
be important to be as significant as the gradient pressure term. From this point of 
view, the deformation radius can be interpreted as the limiting scale under which 
the statistically modified advection due to the inhomogeneity of the small-scale 
component plays a role. 

In the following, we will assume to be at a scale much larger than the deformation 
radius, so that the action of the ISD becomes negligible. In that case, the stationary 
system finally simplifies as 


(u-V)u+ fout + gVh = V - (uh) =0. (10) 
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This system is structurally the same as the nonlinear stationary system, at the 
exception of a scaling constraint on the ISD, which makes negligible the nonlinear 
advection term. As a matter of fact, noticing that foru « Vth, the advection 
term corresponds to the ISD (7), it follows that the geostrophic balance, u = 
—(g/ fo) V th, is a stationary solution of such system for a null ISD. We show below 
it is the only non-trivial stationary solution of such system. 

Let us write the velocity as a superposition of the geostrophic component and an 
ageostrophic component: 


u =- = Vl v, (11) 
fo 


where v is defined through the Helmholtz decomposition from a potential function, 
@, and a stream function W that both depend on the surface elevation: 


v = Vi (h) + Vo (h), (12) 


— V-hW'(h) -- Vh o'(h). (13) 


As Vth belongs to the null space of the velocity gradient (8b), Vh either belongs 
also to the null space or it is an eigenvector of the velocity gradient tensor (Vu). 
From the momentum equation (10) we have 


(Vho'(h))-Vv = — fv. 


Then if Vh belongs to the null space of the velocity gradient, v directly cancels out. 
If is an eigenvector of the velocity gradient with eigenvalue A, the above equation 
reads 


AVhó'(h) = — f (VhW'(h) + V-hà'(h)), 


which implies ®’(h) = V'(h) = 0 and hence v = 0. 

Physically, we see hence that the considered nonlinear system admits a stationary 
solution for a negligible ISD. Let us note that no such constraint is available in 
the deterministic setting. In the following we will interpret this stationary solution 
in the linearized stochastic shallow water system in terms of wave solutions and 
geostrophic adjustment. 


4 Stochastic Rotating Shallow Water Waves 


In order to look at the different wave solutions associated to the stochastic shallow 
water system (2), we proceed in the same way as in the deterministic case [19, 28]. In 
particular, we assume that the noise’s structure in (1b) is independent of the resolved 
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prognostic variable u. In that case, the associated linearized system of (2) reads 


1 
dju + (fou + Vn — 2" -(aVu)) dt+o0,dB,-Vu = 0, (14a) 
1 
din + (uv di 5V -(aVn)) dt + o;dB, - Vj = 0, (14b) 
V-0,;dB;=V-a=0, (14c) 
1 2 2 = 
d, 2^ tul + gl + 5)*) dx — 0. (14d) 


It can be checked that this system conserves the total energy (14d) in the same way 
as the initial nonlinear stochastic system (2) does. For noise defined through Hilbert- 
Schmidt correlation tensor this system admits a mild solution [22]. Existence of 
strong solution could also be shown from the nonlinear system [16]. In order to build 
some simple analytical solutions of this linearized system and to better understand 
the physical behaviours of the resulting waves, only very specific noise models will 
be considered in the following. We first build the ensemble-mean wave solutions 
under homogeneous noise, then investigate the path-wise solutions under constant 
noise and under homogeneous noise but with very smooth structures. 


4.1  Ensemble-Mean Waves Under Homogeneous Noise 


Let us first recall the definition of the homogeneous and incompressible noise: 


c, (x)dB, = I, Vto(x — ydB;(y)dy 


= Y ik; (km) dB; (km) expks +x), (15a) 
a=) 9o kin (Kin) (15b) 


where i denotes the imaginary unit, km = [km, €m]’ € R? is the m-th wavenumber 
vector, e stands for Fourier transform (in space) coefficient and 8; € C are 
independent Brownian motions satisfying 8;(—k,,) = ßr(km) with Re(8;) and 
Im{;} be independent. This noise is homogeneous, and thus associated to a constant 
matrix a. Its ISD is null and fits naturally the condition on the stationary solution 
found in the previous section. 

For sake of simplicity, we assume hereafter that the noise is defined by only 


one Fourier mode (associated to a wavenumber k,) combined with its complex 
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conjugate, namely 
o,(x)dB, = Refika expliko -x)dB,}, a= le kEkL)", (16) 


where œ = @(k,) is assumed to be deterministic and real. This monochromatic 
noise can be directly extended to a multi-scale noise model (15a). Results with a 
multiscale version of the noise will be shown in the numerical section. 

Taking now the expectation (E) of the linearized random system (14), we have 


1 
à, E[u] + foE[u]" + gVE[n] — 5V  aVE[u] = 0. (17a) 


1 
dE[n] + HV -E[u] - 5V -(aVE[n]) = 0. (17b) 


where 9, denotes the partial time derivative. In order to infer the mean wave 
solutions, we look for a deterministic ansatz of the form 


[g] (x, t) = To exp (i(k -x — ot), (18) 


where q = [u, n]' = Re{q} is a compact notation for the prognostic variables of 
the RSW-LU (14), q is the initial constant vector (also assumed to be deterministic) 
and o is the time-frequency. We remark that E[g] = E[q] due to the commutativity 
of expectation with linear operators. 

Injecting next the previous ansatz together with the constant matrix a (16) into 


the system (17), we get Lo = 0 with 


1 0 —foigk 
pe (- io + laf ds x v?) L+| f 0 x22 s (19) 
iHkiHt 0 


where I3 stands for the 3 x 3 identity matrix, ko x k = ko £ — £o k denotes the cross 
product between the wavenumber vectors ko and k with |ks x k| = |k; ||k| sin(0) 
and 0 is the angle between them. 

As usual, the dispersion relations are then given by the solution of det(L) — 0, 
namely 


i i 
w= —5lel (ko xk), o= — 5 lel” ko xk)? + J gH|k? + fe. (20) 


We realize immediately that when the noise has the same direction as the initial 
wave (i.e. kọ x k = 0), these two frequencies correspond to the steady and Poincaré 
(inertia-gravity) waves of the deterministic system [19, 28]. Conversely, when the 
noise is not aligned to the initial wave, then the term — 5o]? (ko x k)? leads to 
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a damping of the ensemble mean wave. We next construct the mean plane wave 
solutions associated to the frequencies (20). 
4.1.4 Mean Poincaré Waves 


With the value of the last two (opposite) frequencies in (20) and the associated 
eigenvector of L (19), one obtains the following polarization relations: 


wk+i fol 
H\k|? ie " 
sja] = NT To exp (i(k - x — ot)) exp( — 5 lel (ko x k)'t). (21) 
1 


Taking the real part, we finally deduce the ensemble-mean wave solution: 


z[n] = fio cos (k - x — ot) exp( — leis x kt), (22a) 
. , k . kt 
[u] - spa] epa]. aa» 
TÁ 1 
ju] 0 = m cos (k +x — wt) exp ( — sla (ko x kt), (22c) 
T 1 
[us |Œ, = pe sin (k «x — ot) exp( — 5 e a x kt), (22d) 


where the component E[uj] is associated to the mean pressure waves that depends 
on surface elevation mean, whereas the latter component E[u] ] is associated to 
mean vorticity waves that are initiated by rotation. Note that in the short waves limit 
with |k||? >> 1/L?, the mean Poincaré wave corresponds to the inertia-gravity wave 


of a shallow water system without rotation. The damping term exp ( — Tol (ko x 
ky^t) depends on the noise's wavelength and variance. This term is zero when the 
noise and the wave are colinear (i.e. the angle 0 between ko and k is zero). For high 
noise magnitude (and 6 Æ 0), the damping occurs in a very fast way. In the long 
wave limit with ||k||? « 1/L?, the frequency approaches the Coriolis frequency 
and the damping term is much less important unless the noise amplitude is very 
high. Nevertheless, the exponential damping in time remains when the noise and 
the waves are not aligned. 


4.1.2 Mean Geostrophic Mode 


The polarization for the eigenvalue w = — 1 la |? (ks x k)? reads 
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—i Że 
N P | 1 
z[g] x. th= E d: To exp (ik -x) exp( — sla s x kt). (23) 
1 


The ensemble-mean of the wave solutions are given by 


" 1 
jn] = fono cos (k x) exp ( — 5e s x k)’t), (24a) 
z[u] = —giok~ sin (k -x) exp ( — laid x ky?t). (24b) 


This is a pure vorticity wave. When the noise and the wave are aligned, it 
corresponds to a steady wave in geostrophic balance, which is also, as we saw, a 
stationary solution of the nonlinear stochastic system associated to a divergence- 
free quadratic variation (10). We next look at the path-wise wave solutions under 
specific noise. 


4.2 Path-Wise Waves Under Constant Noise 


As an initial informative example, we first assume that the noise is constant in space 
using a zeroth order approximation of the Fourier mode exp(i ks + x) in (16). It can 
be expressed as 

c;dB, =Re{iakydB,}, a= o^ ki(kly. (25) 
In order to infer wave solutions, we look for stochastic ansatz of the form 


d (x, 1) = do exp (i (k -x — ot — Re(iyBj))), (26a) 


where y is of unit s~!/*. This ansatz has been shown to be a solution of a linear 
stochastic water waves for constant noise in [8]. Applying now the Itó formula [22] 
for the deterministic function (f, z) > To exp (i (c — ct — z)) composed with the 
random process Re(i y 8,), we deduce 


E l - 
dig = —((io + 5171?) dt + iRefivd,}) a, (26b) 


where the second term on the right-hand-side (RHS) comes from the quadratic 
variation of the random phase. 
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Injecting next the stochastic ansatz (26) as well as the noise definition (25) into 
the linearized system (14), we obtain a system composed of differentiable terms and 
Brownian (martingale) terms that can be compactly written as 


1 1 0 —foigk 
ee _ ; 2 2 2 : 
Lo dt = 0, bed io — zly? + 50° (ks x k) )b+ fo 0 igl), 
iHkiH£ 0 
(27a) 
— iRe[iydf,] + ik -Re{iak;dp,} = 0. (27b) 
The last equation leads to 
y=ak, x k € R. (28) 
Substituting it into (27a), we deduce 
—io —fo igk 
L=] fo —io igl |. (29a) 
iHk iHt —io 


We remark that in this case the linear operator L for the resolved variables reduces to 
the same as that of the deterministic system [19, 28]. Solving subsequently det(L) — 


0 gives us 
o=0, w=+/gH|k?+ få. (29b) 


We next construct the stochastic plane wave solutions associated to these frequen- 
cies. 
4.2.1 Stochastic Poincaré Waves 


With the value of the last two frequencies in (29b) and the associated eigenvector of 
L, one can find the following polarization relations: 


wk + ifoe 
Ake 
G(x.t) =| wl — ifok | fio exp (i (k -x — ot + yImif&))). (30) 
Ake 
1 


Taking the real part of this ansatz, we deduce the path-wise wave solution: 
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n(x, t) = jo cos (k - x — wt + yIm(fi]), (31a) 
u(x, t) Tow cos (k - x — wt + yIm{B ) 
"HK aie 
OT (k «x — ot + yin. (31b) 
H|k| |k| 


In this simple case, we can analytically compute the ensemble-mean from these 
path-wise wave solutions. Note that X, := k - x — ot + yIm(f;j) ~ N(k -x — 
wt, y?t), and the characteristic function of the Gaussian process X; is given by 
z[expG X;)] = exp (i (k -x — ot) — 1y?). One can then deduce the mean of the 
random ansatz (26), taking its real part leads finally to 


z[n] = To cos (k - x — ot) exp( — svt). (32a) 


zju] = ra (ok cos (k-x —ot) + fok sin (k-x—ot)) exp (- jv) (32b) 


It can be readily observed from the random dispersion relation (28), that we recover 
the general mean solution (22) presented in the previous section. 


4.2.2. Stochastic Geostrophic Mode 


The polarization for the eigenvalue w = 0 reads 


~ fo ~ 
G(x.t)=| i£k | Foexp (i(k xd yIm{6:))). (33) 
1 


The path-wise wave solutions are then given by 
n= fofo cos (k -x + ylm(,]), u = —giok~ sin (k -x + yIm(A]). — G4) 
Similarly, one can recover the general mean solution (24) by taking the expectation 


of these path-wise solutions. As in the previous case only the waves that are excited 
by the stochastic forcing remain active on long terms horizon. 
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4.3 Approximation of Path-Wise Waves Under Homogeneous 
Noise 


We now extend the previous solution to statistically homogeneous noise. In the same 
way as previously we will assume a monochromatic noise as defined in (16), but now 
slowly varying in space: 


o,dB, = Refiak; expiek; -x)d&;], a= Jo? ki (KZ, (35) 
where e < | is a small parameter to ensure the smooth structure of the noise. To 


infer wave solutions for such homogeneous noise, we now look for the following 
ansatz: 


q (x,t) = To exp (i (k “xX — ot — Re[iy exp(iek, - 2:2) (36a) 


which generalizes our previous ansatz to homogeneous noise. Applying the Itó 
formula for this ansatz, we have 


zi 1 = 
dig = - (liv + 5r) dt + iRe{iy exp(ieks -x)d6;}) q. (36b) 


Injecting these solutions ansatz into system (14), we can separate again the 
Brownian parts and the time-differentiable component. The former reads 


— iRe{iy exp(iek, - x)dB;} +ik- Re{iaks exp(iek, - x)dB;} = 0, (37a) 
which leads to 
y=ak, xk eR. (37b) 


In a similar way to the previous case, substituting this random dispersion into the 
diagonal component of the resolved linear operator satisfying Lg, dt = 0, we get 
diag(L) = —iwlI3. However, in order to compute the gradient terms of the anti- 
symmetric part of L, the random phase is linearized as 


q (x,t) ~ To exp (i(k “xX — ot — yRe{i(1 +iek, -x)6i})), 
= Gp exp (i ((k + eko yRe{f;}) «x — ot + yIm(f))). (38) 
E 
:=k; 


Hereafter, k, = [k, el" is referred to as the effective wavenumber vector ensuing 
from the space varying random phase. It can be noticed that the real component of 
the complex Brownian path influences the wave’s spatial phase, whereas the wave’s 
temporal phase is randomized by the imaginary component. This latter has already 
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been considered in the constant noise case. With the previous approximation, the 
linear operator L can be finally written as 


—iw — fo igk, 
L= fo. —iw igl, . (39a) 
i HK, i HÀ, —io 


The two resulting dispersion relations are now given by 


o=0, o-rtJsH|E[| + fè. (39b) 


The latter random dispersion relation reduces to the previous relation (29b) (asso- 
ciated to a spatially constant noise) when e — 0. We note that the homogeneous 
random noise leads to wave scattering. Such phenomena corresponds to similar 
results found in the setting of the Wentzel-Kramers-Brillouin approximation [15, 21] 
or more recently through Wigner transform [7, 13, 14]. The stochastic framework 
explored here leads nevertheless to simpler formal developments. 

In the same way as previously, we exhibit in the following the two types of 
stochastic waves associated to this spatially slowly varying homogeneous noise. 


4.3.1 Stochastic Poincaré Waves 
The path-wise wave solution in this case can be written as 


n(x, t) = To cos (K, +x — wt + yIm(fj)), (40a) 


u(x,t) = 


cos (k, - “x — ot 4 yIm(A]) A ra 
i 


sin (k, - +x — ot + yIm(fi) pa (40b) 


[k| 


It can be remarked that the surface elevation phase has two sources of randomness: 
a temporal one, in the modified frequency, and one in space coming from the 
space varying noise. For k = k, the solutions corresponds again to the classical 
deterministic waves, while, as shown in Sect. 4.1, when k # k,, the mean of 
the stochastic wave solutions (22) is damped compared to the deterministic surface 
elevation (ha (x, t)). 
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4.3.2. Stochastic Geostrophic Mode 


The path-wise wave solutions are given by 


n = foto cos (k, -x + yIm(&])), u= —gijok, sin (kı -x -ylm(&j]). (4D 


These solutions correspond to dispersive geostrophic modes wave packet. The 
velocity wave is a pure vorticity wave packet. Its ensemble mean is also damped 
for k Z ko and corresponds to the geostrophic stationary wave for k = ko (24). 

As a final word, on the general linear stochastic shallow water system, it can 
be noticed that in the long wave limit with |k|? « 1/L2, (when the frequency 
approaches the Coriolis frequency), the pressure gradient force becomes negligible 
compared to the other terms in (14a), and for a noise amplitude of order unity the 
linear shallow water system boils down to a linear stochastic transport equation in a 
rotating frame: 


1 
du + (fou — zy . (aVu)) dt + o,dB; $ Vu = 0. (42) 


Up to the diffusion and noise term (whose energy balances exactly) this closely 
corresponds to the so-called near inertial regime in which the fluid is purely 
transported. In the LU setting, the noise acts always as a random deviation whose 
energy is exactly compensated by the diffusion term. In the particular case of the 
shallow water model (linear or nonlinear) the total energy is conserved. For the 
linear system the total energy of the mean being damped up to a constant, the total 
energy variance increases up to a constant as a consequence of the total energy 
conservation. 


4.4 Numerical Illustrations 


We next give simple illustrations of the stochastic wave solutions. Here, rather than 
evaluating the analytical solutions presented in the previous sections, we propose 
to discretize numerically the linearized RSW-LU system (14) and perform Monte- 
Carlo simulations in order to estimate both path-wise and ensemble-mean wave 
solutions. To that end, the spectral (Fourier) method is adopted for the spatial 
discretization within a periodic domain, and an exponential integrator [8] combined 
with the Milstein scheme [9] is used to approximate the mild solution. This semi- 
discrete problem can be written as 


q = —ik - (q,0, AB), (43a) 


q® = —ik - (qo; AB;), (43b) 
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PN p — ]— 
Gitar = exp( AAT) (s. +q” + T) ; (43c) 
0 fo -—igk u 
A-| —fo 0 -—igt|. q-—|v|, (43d) 
-iHk-iHt 0 n 


where @ denotes the projection coefficient on the discrete Fourier modes, q = 
F! (q) is the inverse discrete Fourier transform of g, At and AB, stands for the 
timestep and the Brownian motion's increment respectively. We remark that the 
classical 2/3 dealiasing rule can be adopted for the practical computations of (43a) 
and (43b). 

A deterministic monochromatic wave corresponding to a single frequency of the 
Poincaré waves (propagating to the left) is fixed as the initial condition: 


ck-i fot 
ua z 
Go = | ATE | ak- ko), = eH|kl’ + f. (e) 
1 


where ô denotes the Dirac function and kọ is the initial wavenumber vector. 

For the simulation configuration, we consider a squared shallow basin of length 
L = 5120km and depth H = 100 m at mid-latitude (fọ = 107^ s7!) and a large- 
scale wave with k = [3Ak,0A€]’, where Ak = A£ = 2z/L. Each random 
system either under constant (25) or under homogeneous noises (16) has been 
simulated over 5 years with 100 realizations. We remark that here we do not use the 
smoothness approximation (35) in the homogeneous case. The noise’s amplitude is 
fixed asa = /Tg/fo, where t = Ax/./gH with g = 9.81ms~*, Ax = 40km 
and At = 5r. 

Figure | illustrates the evolution of the path-wise surface elevation 7 and of 
the ensemble-mean solution E[5] under a constant noise which has a different 
direction (with kọ = [4Ak, 6A£]") than that of the wave (ko). In this case, the path- 
wise solution preserves the magnitude of the initial monochromatic wave while the 
ensemble-mean wave is damped along time. 

Figure 2 demonstrates the results obtained with the homogeneous noise (with 
the same ko as in the previous case). In that case, the path-wise wave is dispersive 
(scattering effect) whereas the mean wave is dissipative. 

The solutions related to the homogeneous noise is rougher than for a constant 
noise. The damping associated to the mean solution is visually similar in both case. 

We then diagnosed an energy decomposition with respect to ensemble of 
different runs in Fig. 3. This decomposition consists of the (ensemble) mean of 
energy j[ Ip 5(H|u|?+gn7) dx], the energy of (ensemble) mean Io 5(Al ou] P+ 
gE[n]*) dx and the energy of “eddy” E[ fp (H|u — E[u]|? + g(n — E[n)?) dx]. 
Figure 3 shows that for both noise models, the energy of mean waves is quickly 
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Fig. 1 Illustration for the path-wise surface elevation (top) and its ensemble-mean (bottom) with 
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Fig. 2 Illustration for the path-wise surface elevation (top) and its ensemble-mean (bottom) with 
an homogeneous noise (with ko x k 4 0) at different time (by columns) 


dissipates along time while the energy of eddy waves increases with the same 
rate and continuously backscatters the variance to the ensemble. This mechanism 
ensures that the mean of the random energy is preserved in time. This corresponds 
well to the characteristic (2d), (14d) of the proposed stochastic transport model. 
Figure 3 illustrates also the dissipation rate of the energy of the mean waves in 
terms of different scales of the noise (equivalently, different angles of directions 
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Fig. 3 (Top) Time evolution of energy decomposition (w.r.t. ensemble) with constant (left) and 
homogeneous (right) noises; (Bottom) Comparison of dissipation of ensemble mean for different 
noise’s scales 
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Fig. 4 Illustrations under multi-scale noise. From left to right: pathwise wave solution after 5 years 
simulation, the corresponding ensemble-mean and evolution of energy decomposition 


between the noise and the wave). The numerical results confirm our analyses in 
Sect. 4: the larger the angle 6, or the smaller the noise's scale ks, the faster the 
mean waves are damped in both cases. 

Furthermore, propagation of the monochromatic wave by a multi-scale noise 
model have been numerically tested. In particular, we consider a band of wavenum- 
bers for the noise, kg = {kj}j=m,...m With |km| < |kj| < |ku|, Vm < j < M, 
satisfying the —3 spectrum power law: o?|k |? = aZ Ikt]? (Ik ; |/Iks1) 3, i.e. oj = 
Om (|k j|/lKm|)~>/?. Figure 4 shows the results with km = [5Ak, 5A£]", o = o (as 
mentioned above) and 10 wavenumbers in total with uniform step Ak. After 5 years 
of simulation, the pathwise wave is dispersive while the mean wave is dissipative, 
and both of them are more irregular than the monochromatic noise solutions (see 
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Fig. 2). We obtain also the consistent conclusion for the ensemble decomposition of 
the total energy. Moreover, the conversion from energy of mean to energy of eddy 
in this case is more faster and efficient than that of the monochromatic noise model 
(see Fig. 3). 


5 Shallow Water PV Dynamics and Geostrophic Adjustment 


The geostrophic adjustment is the process by which the flow and the pressure 
field tend to mutually adjust at large scale under the influence of earth’s rotation. 
An obvious manifestation of this adjustment corresponds to isobaric wind field 
and geostrophic current in the ocean. In the classical case of the deterministic 
linear shallow water model the geostrophic adjustment can be explained in terms 
of a variational formulation as the state of minimum energy corresponding to a 
given constant value of the potential vorticity (PV). Within the LU modelling, the 
geostrophic adjustment can be explained from the nonlinear system. To that end, let 
us derive the PV equation of the stochastic system (2). Taking first the curl of the 
momentum equation (2a) under the space-time invariant Coriolis parameter fo, we 
have 


DiE + fo) = —& + fo)V -udt + [V^u, Vo;dB,], 


1 
us 3 Ox, dx, (Vaij x u) dt, (45a) 


i,j-1,2 
where £ = V xu = 0yv—0,u denotes the relative vorticity and [A, B]; = tr(A’ B) 
stands for the Frobenius inner product of two matrices A and B. Applying next the 
chain rule [24] for the stochastic transport of height (2b), one obtain 

D,h-! 2 n-!v .udt. (45b) 


Let us recall the product rules of two stochastic transport equations in the following. 
In particular, if two arbitrary tracers 0 and ¢ satisfy 


D,0— Odi, Dt —Zdt4 XdB,, (46a) 


where O, Z are time-differentiable forcing terms and X a martingale forcing 
component, then according to the Itó-integration-by-part formula [22], we have 


t t 
D; (6c) = 6D,¢ + tD,6 — a f o,dB, vo, [ ZàB,), (46b) 
0 0 
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where the last bracket term denotes the quadratic covariation of two martingales 
[22]. Applying such rule for 0 = ^-^! and ¢ = & + fo associated with (45), one 
deduces the stochastic evolution of the PV, q = (& + fo)/ h, namely 


D;q = h^! [V^u, Vo;dB;], + "m XO 99. (Vaij x u) dt 


i,j=1,2 
t t 

- a f dB, vit, f [V+u, Vo,dB,],) . (47a) 
0 0 t 


Opposite to the deterministic shallow water case the PV is not transported by the 
stochastic flow in general and some source/sink terms appear on the right-hand 
side of this PV equation. These source/sink terms reflect here the action of the 
small-scale on the non conservation of PV. In the deterministic context, PV is very 
sensitive to turbulence and subgrid modelling [3]. The same mechanism is at play 
here. We can nevertheless explore the condition for which PV remains conserved in 
the setting of a stochastic modeling of the small scales effect. The first and last terms 
cancel if the large-scale velocity field and the small-scale random component align 
with each other up to a uniform vector field. The second term trivially cancels if the 
random field is homogeneous in space (as in that case a becomes a constant matrix). 
With these two previous conditions (alignment and homogeneity) PV is path-wise 
conserved. For homogeneous (incompressible) noise the expectation can be written 
in flux form and the mean PV is globally conserved. For homogeneous field, the PV 
equation reduces then to 


id =h"'[Vtu, Ve;dB;], — d f 


o,dB, SENI 
0 


l [V^u, Vo;dB,],) . 


(47b) 


The above equation, combined with the previous results on the stochastic linear 
dynamics, enables revisiting the mechanism of geostrophic adjustment in the 
presence of a forcing. The corresponding global pictures is as follows. Let us 
first consider random fluctuations in the ocean generated by wind forcing at large 
scales. Then, due to the wavelength mechanism described in the previous section, 
all the waves that are not aligned with this forcing are smoothed out and eventually 
annihilated following an exponential decay. The only waves that eventually remain 
are aligned with the wind forcing, which at large scale corresponds essentially to 
near inertial waves, and the PV dynamics corresponds then to a pure transport. 
Thus, the system tends then to relax to a balanced state as in the deterministic 
case. Conversely, we can devise a similar picture in the atmospheric context, in a 
configuration where the ocean plays the role of a noise for the atmosphere. The 
atmospheric waves will evolve toward a near inertial wave field aligned with the 
noise, following the exact same process. As a result, merging these two perspectives 
provides an interesting ocean/atmosphere coupling mechanisms of auto adjustment. 
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At small scales, things are more complicated as the ISD has to be taken into account 
with an isotropization process that is likely less obvious. As a result the forcing 
terms in the PV equation remain. Furthermore, in that case the smooth spatial 
structure assumption of the noise imposes a very low noise amplitude. For the study 
of strong small-scale forcing this assumption as well as the wave ansatz associated 
to it have to be revisited. This will be the objective of future works. 


6 Conclusions 


This stochastic extension of the shallow water equations highlights the behavior 
of the LU setting for large-scale representation of flow dynamics. Opposite to 
classical eddy viscosity models, which introduces a similar damping term on the 
waves form, here, the waves that are sustained by the noise term keep their full 
classical expressions. This provides a simple mechanism for waves selection and 
an interesting simplified model explaining the emergence of near inertial waves in 
ocean atmosphere systems. In the LU representation of the linearized shallow water 
system, the ensemble-mean waves that are not excited by a noise term with the same 
wavelength vanish exponentially fast whereas the others correspond to the classical 
deterministic wave solutions. The decay rate depends on the noise variance and on 
the wavenumber (in a quadratic way). The vanishing is therefore all the more fast 
for waves with small wavelength. The noise acts hence as a Dirac comb on the mean 
wave field. 
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Said Ouala, Bertrand Chapron, Fabrice Collard, Lucile Gaultier, 
and Ronan Fablet 


Abstract Sea surface temperature (SST) is a critical factor in the global climate 
system and plays a key role in many marine processes. Understanding the variability 
of SST is therefore important for a range of applications, including weather and 
climate prediction, ocean circulation modeling, and marine resource management. 
In this study, we use machine learning techniques to analyze SST anomaly (SSTA) 
data from the Mediterranean Sea over a period of 33 years. The objective is to best 
explain the temporal variability of the SSTA extremes. These extremes are revealed 
to be well explained through a non-linear interaction between multi-scale processes. 
The results contribute to better unveil factors influencing SSTA extremes, and the 
development of more accurate prediction models. 


Keywords Sea surface temperature - Machine learning - Stochastic models - 
Extremes 


1 Introduction 


Sea surface temperature (SST) is a critical parameter in the global climate system 
[1, 2] and plays a vital role in many marine processes, including ocean circulation, 
evaporation, and the exchange of heat and moisture between the ocean and 
atmosphere [3, 4, 5]. 

In recent years, particular attention has been attracted by marine heat waves, 
when SST largely exceeds the local expected average values [6, 7]. Extreme SST 
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can cause coral bleaching [8, 9], with cascading effects on the entire ecosystem. 
Additionally, localized events affect the amount of atmospheric moisture available, 
to impact precipitation patterns and the likelihood of drought or flooding in certain 
regions [10]. Better uncovering factors contributing to these extreme events is 
therefore of great importance to help predicting and mitigating their impacts. 

The SST dynamics compound many processes that interact across a continuum 
of spatio-temporal scales. A first-order approximation of such a system was 
initially introduced by [11, 12]. Hasselmann pioneered a two-scale stochastic 
decomposition, to represent the interactions between slow and fast variables. In this 
study, we focus on SSTA data collected in the Mediterranean Sea, and examine 
the potential of machine learning techniques to derive relevant dynamical models. 
Focus is given on the seasonal modulation of the SSTA and we wish to unveil factors 
influencing the temporal variability of SSTA extremes. The proposed analysis builds 
on Hasselmann’s assumption that the variability of the SSTA can be decomposed 
into slow and fast components. The slow variables mostly follows the seasonal 
cycle, while the fast variables are linked to rapid processes, e.g. the wind variability. 
We thus approximate the probability density function of the SSTA data, using a 
stochastic differential equation in which the drift function represents the seasonal 
cycle and the diffusion function represents the envelope of the fast SSTA response. 

The paper is organized as follows. We start by introducing the general under- 
lying state space model of the SST anomaly. Rather than directly presenting 
the stochastic model, we first assume that an underlying deterministic ordinary 
differential equation (ODE) can represent the non-periodic variability of the SSTA. 
Considering a phase space reconstruction setting, we use the neural embedding of 
dynamical systems (NbedDyn) framework [13, 14] for this task. We then discuss the 
limitations of such a representation, and present the stochastic model. We conclude 
by summarizing our findings and potential future directions. 


2 Method 


Let us assume the following state-space model 


zt = f(z) () 
x; = H(t) (2) 


where t € [0, +00] is time. The variables z; € IR? and x, € R” represent the 
state variables and the SST anomaly observations respectively. f and H; are the 
dynamical and observation operators. The impact of noise on the dynamics and 
observation models is omitted for simplicity of the presentation. 
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2.1 Deterministic Model Hypothesis 


The NbedDyn Framework If we assume that z is asymptotic to a limit-set L C 
IR* and that the observations model is not an embedding [15], The NbedDyn model 
allows one to jointly derive a geometric reconstruction of the unseen phase space 
from partial observations and a corresponding dynamical model. For any given 
operator H of a deterministic dynamical system, Takens theorem [16] guarantees 
that such an augmented space exists. However, instead of using a delay embedding, 
NbedDyn defines a dg-dimensional augmented space with states u, € R? as 
follows: 


T TaT 
u =[x.y;] (3) 
where y, € R“#~" are stated as the latent states and T represents the matrix 


transpose. They account for the unobserved components of the true state z,. 


The augmented state u; is assumed to satisfy the following state space model: 


ù; = fo, (uq) (4) 
x, = (Guy) (5) 


where G is a projection matrix that satisfies x; = Guy. The dynamical operator fa, 
belongs to a given family of neural network operators parameterized by a parameter 
vector 6). In this work, we follow [14] and use a linear quadratic parameterization 
of fo,. This particular parameterization allows us to guarantee boundedness of the 
ODE (4) using the Schlegel boundedness theorem [17]. A linear quadratic ODE 
model can be written as follows: 


ù = fo, (ug) = € + Lu, + [u/ Qu, ...,u Qu]? (6) 
where c € R, L e R¢E*4E and Q = [usd ast = 1,...,dg. The above 
approximate model is shifted according to uy = u; — m with m € RE. The 


approximate dynamical equation of the shifted state can be written as: 

a, = d + Aa, + [u QU? u,, ..., uT Q2 a," (7) 
with 

d = c + Lm + [m QU m, ..., m^ QU? m]? (8) 


and 
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A= o = (i + » ijk + aama) (9) 


k=1 


Given an observation time series of size N + 1 {Xi,..., Xt}, the training setting 


comes to jointly learning the model parameters 01 = (c, L, Qt, Q? 4 otk, m} 
and the latent states y, according to the following constrained optimization problem 


N 
01, dm = arg PEE } »» IX; > G9», ;, (us) I 
UH j= 


+ Alu, — 96, ;(u; II? (10) 

T holy 

+ à3C2 
with $g,;(uj.j) = W- + Sii fo, Uw)dw is the flow of the ODE (6) 
(in our work, this flow is approximated using a Runge Kutta 4 scheme) 
and C = D jlldijk + dikj + bjik + bjki + bkij + bk jil? and 
C2 = = Max(a;, 0)/Max(a; + 1,0) where oj, i = 1,...,dg¢ the eigenvalues 
of the matrix A, = (A + AT). The variables A1,2,3 are constant weighting 


parameters. The first constraint Cı steams from the energy-preserving condition of 
the quadratic non-linearity. It forces the contribution of the quadratic terms of fo, 
to the fluctuation energy to sum up to zero. The second constraint, C2, ensures that 
the eigenvalues of A; are negative. Satisfying these constraints guarantees that the 
model fg, is bounded through the existence of a monotonically attracting trapping 
region that includes the limit-set revealed by the minimization of the forecasting 
loss. Similarly to the Takens delay embedding technique, the sequence: 


Rosy = (0), = [x], 97 ] with t; = 0, ..., ty} (11) 


represents a geometric reconstruction of the phase space. In addition to this 
reconstruction, the NbedDyn model can be used to forecast new observations by 
determining an initial condition of the unobserved component y, and performing 
a numerical integration of the ODE model (6). We infer the initial condition using 
a minimization of an objective function similar to (10), but only with respect to 
the latent states y,. This minimization can be seen as a variational data assimilation 
problem, with partial observations of the state-space variables and known dynamical 
and observation models [18]. 


Related Works Related state-of-the-art techniques mainly rely on the reconstruc- 
tion of a phase space using delay embedding [16]. This includes both traditional 
parametric and non-parametric modeling techniques [19, 20] as well as recurrent 
neural networks (RNNs). The latter family of methods includes both simple RNN 
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parameterizations of dynamical systems, as well as latent space inference techniques 
that are built on an approximation of a posterior distribution that requires the 
parameterization of a delay embedding [21, 22, 23]. 


The interest of the NbedDyn framework in contrast to delay embedding based 
approaches resides in the fact that we do not exploit either a delay embedding or an 
explicit modeling of the inference model (i.e., the reconstruction of the latent states 
given the observed time series). As such, our scheme only involves the selection 
of the class of ODEs of interest. This model reduces the complexity of the overall 
scheme to the complexity of the ODE representation and guarantees the consistency 
of the reconstructed latent states w.r.t. the learnt ODE. 


2.2 Stochastic Model Hypothesis: The Stochastic NbedDyn 


When using phase space reconstruction techniques, one should not forget about 
the assumptions that this theory is built on. For any embedding to work, we are 
assuming that the dynamical model in (1) exists and can be represented by an 
ordinary differential equation [15]. For several realistic applications, this ODE may 
not exist or can have an extremely large dimension. In geoscience, for instance, 
the dimension of a state space variable can reach s ^ O (10?). In these situations, 
reconstructing such an high-dimensional phase space becomes significantly more 
challenging. In practice, the model returned by any embedding technique can be 
complemented by an appropriate closure. The form of this closure term can be 
deterministic using for example the framework of [24] or stochastic through an 
appropriate calibration of a noise forcing. 

When considering SST anomaly data, after calibration of the neural embedding 
model, an unpredictable, high frequency residual remains. Based on Hasselmann's 
idea, we assume this residual component represents the effect of fast-scale pro- 
cesses, e.g. passages of atmospheric and oceanic eddies. To first order, it can be 
represented as a modulated white noise. Indeed, this residual, shown in Fig. 3, 
exhibits correlations with the slow-scale SST anomaly data. 

To model stochastic SST anomalies, the deterministic NbedDyn model described 
above is first optimized, and further complemented (6) with a stochastic forcing as 
follows: 


I = fo, (ui) + go, (uj), (12) 


X; = Gu; 


with &, is a white noise. We derive the parameters of the model (12), as follows. 
Given an observation time series of size N + 1 {Xp,..., Xr}, similarly to the 
deterministic case, we optimize the diffusion parameters 02 to minimize the forecast 
of the observations. In addition to the diffusion parameters, we also reconstruct a 
noise realization £'*^ that generates the observations process under (12). Overall, 
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the optimization problem can be written as follows: 


T 
^ arec i=N—1 7 " 2 
9E. P5 = arg min ) IIx; — Go, (ui, &7*5) | 
á t=1 


(13) 
u; = $,,(u;-1, 6") 
Subject to 1 Gu; = Xx; 
Rgrecgree(t) = 0 for all c #0 


. Zrléc {= — . . . . . . . . . . 
with {&,, p ' is the noise realization that minimizes the objective function 
in (13) and $5» z: 


t t 
Po (uj, .,, £8 ^^) = wy + j fo (u,)dw + f go (u,)& dw 
t—1 t—1 


the solution of the stochastic model. This solution is approximated in this work 
using an Euler-Maruyama scheme, which makes the model converge to an Ito SDE. 
In practice, we use the following regularized optimization problem: 


T 
ôs, 8^ = arg 2. $ lx — Gg, (ui. er 


2 t=1 
+ A4C3 (14) 
+ AsCa 
+ AeCs 


with C3 = ||Rgreegree(t)||?, C4 = Var(o,1(ur—1, £'""P)), C5 = ||Bo, 0-1, £'*^) — 
E[9» , (u;..;, &'^"?)]||? and £^"? is a sampled Gaussian white noise. The variables 
À4,5,6 are constant weighting parameters. The first constraint C4 makes the recon- 
structed noise path white. The second and third constraints, C4,5, ensure that the 
SDE generalizes to sampled white noises. Specifically C4 makes an ensemble of 
trajectories generated from sampled white noise close to the trajectory generated 
from the reconstructed noise and C» reduces the spread of the ensemble around the 
trajectory simulated from the reconstructed noise. 

After optimization, we can couple the optimization problems (10) and (14) and 
calibrate jointly all the model parameters 01, 05, y,, £'*^. This fine tuning step is not 
essential but allows both the drift and diffusion parameters of the model to adapt to 
each other. 
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3 Numerical Experiments 


3.1 Data 


Sea Surface Temperature Anomalies (SSTA) in the Mediterranean Sea correspond 
to the Ligurian Sea at 8.6°E, 43.8°N. The anomalies are computed based on a yearly 
average of the annual 99th percentile of the SST reanalysis [25, 26]. The time series 
is made up of daily SST anomaly measurements from 1987 to 2019. We use the daily 
data from 1987 to 2014 as training data. Figure 3e illustrates the time series. These 


time series include a seasonal cycle and non-periodic high temperature extremes in 
the summer. 


3.2 Analysis of the Deterministic Model 


In this first experiment, we investigate if the deterministic neural embedding model 
is able to model the non-periodic variability of the SSTA extremes. For this purpose, 
we test 3 models with dimensions of the embedding ranging from 1 to 10. 


Analysis of the Embeddings The choice of the dimension dg is linked to the 
number of independent variables that can be used to model the dynamics using, in 
our context, a bounded autonomous linear quadratic ODE. We start by studying the 
direct impact of dg on the performance of the NbedDyn model. Figure 1 shows 
the impact of dg on the training error between the observations and the model 
simulation. Other criteria could be used (please refer to [13, 14] for a more in 
depth analysis of this parameters on other case studies), but overall, the training 
error provides a direct measure of the effectiveness of the embedding dimension 

in the training phase. The first evaluation of the training error reported in Fig. 1 


Fig. 1 Mean training error at 
convergence. We report the 
mean training error at 
convergence of the nn 
deterministic NbedDyn 
model for different 
dimensions dg of the 
embedding. This error is 
averaged over the training 
time series, and we highlight 
here both the mean and 
standard deviations 
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corresponds to dg equal to the dimension of the measurements, i.e. dg = 1. 
In this experiment, no latent states y, are used and the embedding u; = x;. In 
such situations, the ODE model can not perfectly fit the data. Furthermore, at this 
particular value of dg, the models are more likely to display a bad asymptotic 
behavior. As the dimension increases, this training error decreases which confirms 
better modeling abilities using the NbedDyn model. In the following, we study the 
models with dg = 3, 6 and 10. 


Asymptotic Properties of the Models We evaluate the asymptotic behavior of the 
deterministic models for dg — 3, 6 and 10. For this purpose, we run the nbedDyn 
models for a period of 27 years. The resulting simulation is visualized with respect 
to the reconstructed phase space (11) of the training data in Fig. 2. Overall, the 


(a) (b) 


——— Model simulation 
Reconstructed phase space 


Yo — 


Fig. 2 Asymptotic solution of the deterministic models. We visualize the simulation of the 
deterministic models with respect to the reconstructed phase space. The models with dg — 3,6 
and 10 are given in figure (a), (b) and (c) respectively. For dg > 3, we project the simulation and 
the reconstructed phase space into R? 
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models are only able to reproduce the seasonal cycle of the SST anomaly data. Other 
experiments (not shown here) suggest that even a farther increase of the dimension 
of the embedding does not allow the model to capture the non-periodic behavior of 
the SST anomaly extremes. 


Analysis of the Training Residuals To further investigate the asymptotic behav- 
ior of the deterministic models, we visualize in Fig. 3 the training residual 
(xi; — GBo r (uj ,) with t; = to, ..., ty]. When the dimension of the embedding 
increases above dg — 1, a qualitative and quantitative change in the residual error 
is present. This is due to the fact that a two-dimensional ODE (in IR) is needed 
to capture the oscillations of the seasonal cycle of the SSTA. However, when the 
dimension of the embedding increases above 2, no clear qualitative or quantitative 
change is present. Furthermore, the residual is much more high frequency than 
the training SSTA data, which suggests that the errors are due to a missing high 
frequency scales that can not be modeled using the standard deterministic model. 


Based on these considerations, and motivated by Hasselmann's works on stochas- 
tic climate models with applications on SST anomaly data, we proposed the 
stochastic NbedDyn model. In this framework, the SST residual of Fig. 3 is modeled 
as a stochastic forcing. 


3.3 Analysis of the Stochastic Model 


We focus our analysis on the model with dg — 6. We add a stochastic forcing to the 
neural embedding model (the parameters of the diffusion function ge, are optimized 
according to Appendix 1). Figure 4 shows the reconstructed phase space under 
this new model, as well as a model simulation of 27 years. When compared to the 
simulations of the deterministic model in Fig. 2, the stochastic model is able to cover 
the whole reconstructed phase space, including the regions with high temperature 
extremes. This shows that including the high frequency forcing is crucial for the 
model to capture the non-periodic behavior of the extremes. 

These observations are further illustrated in the simulation example given in 
Fig. 5. The stochastic model is able to produce an ensemble of SST anomaly 
trajectories that reproduce the non-periodic variability of the extremes. Furthermore, 
the trajectories generated from a sampled white noise match the one of the 
reconstructed noise, which validates the proposed training procedure. 

We can also discuss the marginal PDF of the stochastic model and compare it to 
the one computed from the data in Fig. 6. The PDF of the model is computed over a 
simulation of 109 years. Overall, the model is able to correctly model the high SST 
anomalies (in the summer), including the non-periodic extremes that form the tail of 
the distribution. The negative SST anomalies (in the winter) are not approximated 
as good as in the summer case. This is due to the fact that the model flattens the 
PDF in the winter by generating trajectories that have more spread (as highlighted 
in the ensemble prediction experiment in Fig. 5). We did not investigate this problem 
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Fig. 3 Training residual and corresponding training data. We visualize the training residual for 


dg = 1 in (a), dg = 3 in (b), dg = 6 in (c) and dg = 10 in (d). Corresponding training data are 
given in (e) 


Analysis of Sea Surface Temperature Variability Using Machine Learning 257 


Fig. 4 Simulation of the 
stochastic model. We 
visualize a stochastic model 
simulation and compare it to 
the reconstructed phase space. 
Both the model simulation 
and reconstructions are 
projected into IR? 
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Fig. 5 Ensemble simulation of the stochastic model. We visualize an ensemble simulation of 
the stochastic model, both in the training (a) and test (b) sets. The simulation in the training 
set is carried to compare trajectories computed from a sampled noise to the one issued from the 


reconstructed noise £"** 


Fig. 6 PDF of the data and 
the stochastic model. We 
compare the marginal PDF of 
the stochastic model with the 
one of the data 


—— — Data PDF 
——- Model PDF 


within the present study. However, we can make the PDF sharper in the winter by 
forcing the diffusion of the model to be closer to zero during this season. 
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4 Conclusion 


In this work, we examined the potential of machine learning techniques to derive 
relevant dynamical models of sea surface temperature anomaly data in the Mediter- 
ranean Sea. We focused on the seasonal modulation of SSTA extremes and used 
a neural embedding model to reconstruct the phase space of SSTA data. We then 
added a stochastic forcing term to account for the missing high frequency variability. 
Our results contribute to the understanding of the factors influencing SSTA extremes 
and the development of more accurate prediction models. In particular, the analysis 
highlights the importance of including these fast high-frequency scales in the 
modeling of SSTA data. 

One potential avenue for future work is to investigate the white noise hypothesis 
in comparison to other types of stochastic models, such as those based on colored 
noise or fractional Brownian motion. Furthermore, it would be interesting to apply 
the methodology to other regions and compare the results to evaluate the local 
impacts of the fast-scales on the slower ones. The ability of using this model as 
an emulator and studying its predictive skills with respect to standard ocean data 
assimilation based systems is also a promising perspective. 

Finally, and from a methodological point of view, this work highlights the impor- 
tance of complementing models that are returned by an embedding methodology. 
Specifically, and as discussed in Sect. 2.2, in complex applications such as the ones 
in geosciences, the dimension of the underlying state variables is likely to be huge 
and defining ways of complementing reduced order models through appropriate 
closure terms is mandatory in order to capture the variability of the data. Analysing 
the residual of the model fitting procedure is a natural way to define and optimize 
this closure terms. 


Appendix 1: Training 


The trainable parameters of the deterministic NbedDyn models i.e. the linear 
quadratic ODE and initial conditions of the latent states are initially sampled from a 
uniform distribution. The training of all models is carried using the Adam optimizer. 
We use a varying learning rate (from 0.1 to 0.001) in all the experiments to speed 
up the training. Regarding the weighting parameters (A; ie. we set A; = 1 during 
all the training. The weights responsible for the boundedness constraints were set at 
higher values in the beginning of the training i.e. A2 = 100 and A3 = 1000 and 
then reduced to Az = 1 when A3C2 = O. The training is stopped using cross- 
validation. Regarding the stochastic forcing, the parameters of the diffusion are 
initially sampled from a uniform distribution and the noise path "°° is initialized 
from a standard normal distribution. We use a learning rate of 0.001 and the 
weighting parameters {A; =? to one during all the training. We finished the training 
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with a fine tuning step, in which all the parameters of the model are optimized jointly 
with a learning rate of 0.0001. 


Appendix 2: Parameterization of the Diffusion Function 


The diffusion function ge, : RE —+ R4E*4N where dy is the dimension of the 
noise. In our experiment, we parameterized this function using a fully connected 
neural networks with 2 hidden layers with a sigmoid activation and 400 neurones 
per hidden layer. The dimension of the noise dy is set to the dimension of the state 
uz. 
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Data Assimilation: A Dynamic A 
Homotopy-Based Coupling Approach ems | 


Sebastian Reich 


Abstract Homotopy approaches to Bayesian inference have found wide- spread 
use especially if the Kullback-Leibler divergence between the prior and the 
posterior distribution is large. Here we extend one of these homotopy approaches 
to include an underlying stochastic diffusion process. The underlying mathematical 
problem is closely related to the Schródinger bridge problem for given marginal 
distributions. We demonstrate that the proposed homotopy approach provides a 
computationally tractable approximation to the underlying bridge problem. In 
particular, our implementation builds upon the widely used ensemble Kalman filter 
methodology and extends it to Schródinger bridge problems within the context of 
sequential data assimilation. 


1 Introduction 


Sequential data assimilation interlaces dynamic processes with intermittent partial 
state observations in order to provide reliable state estimates and their uncertainties. 
A wide array of numerical methods have been proposed to tackle this problem 
computationally. Popular methods include sequential Monte Carlo, variational infer- 
ence, and various ensemble Kalman filter formulations [5, 8]. These methods can 
encounter difficulties whenever the predictive distribution is incompatible with the 
incoming data; in other words whenever the distance between the prior, as provided 
by the underlying stochastic process, and the data informed posterior distribution is 
large. It has long been realized that this challenge can be partially circumvented 
by altering the underlying stochastic process through appropriate control terms 
or modified proposal densities [5, 23, 16, 9]. Recently, the connection between 
devising such control terms and Schródinger bridge problems [4] has been made 
explicit [16]. However, Schródinger bridge problems are notoriously difficult to 
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solve numerically. The key contribution of this paper is to provide a computationally 
tractable (sub-optimal) solution via a novel extension of established homotopy 
approaches [7, 14]. Similar to related homotopy approaches for purely Bayesian 
inference, the solution of certain partial differential equations (PDE) is required 
in order to find the desired control terms [14, 25]. In line with standard ensemble 
Kalman filter (EnKF) methodologies we approximate these PDEs via a constant 
gain approximation [21]. There are also alternative approaches to sequential data 
assimilation or inference which utilize ideas from optimal transportation; see for 
example [15, 6, 20, 3]. 

The paper is structured as follows. The mathematical formulation of the data 
assimilation problem, as considered in this paper, is laid out in Sect. 2. The standard 
optimal control and Schrédinger bridge approach to data assimilation is briefly 
summarized in Sect. 3, and the novel control formulation based on an homotopy 
formulation is introduced in Sect. 4. A practical implementation based on the 
EnKF methodology is proposed in Sect. 5. A series of increasingly complex data 
assimilation problems is considered in Sect. 6 in order to demonstrate the feasibility 
of the proposed methodologies. The paper concludes with some conclusions in 
Sect. 7. Detailed mathematical derivations can be found in Appendices 1 and 2, 
respectively. 


2 Problem Formulation and Background 


We consider drift diffusion processes given by a stochastic differential equations 
(SDE) 


dX, = f(X,)dt + V2o0dW,, (1) 


where X, : Q — R4, f R4 + Re oe Ro, and W; : Q > R4 denotes 
d, -dimensional standard Brownian motion [12, 13]. 

Assuming the law of X; is absolutely continuous w.r.t. Lebesgue measure with 
density ;, this leads to the Fokker-Planck equation [13] 


à; = —V - (m (f —oVlogz;)). (2) 


The SDE (1) can be replaced by the mean field ODE 


d. : - 
rt = f(Xi) — oV log ñ, (3) 


where ï, denotes the law of X,. Provided ño = zo, it holds that 7, = 7, for all 
t > 0. Note that the evolution of the random variable X; is entirely deterministic 
subject to random initial conditions X 0 ^v Io. 

At time t = T > 0, we have observations of the system according to 
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yr — h(x)) v (4) 


from which we wish to infer the unknown state a. € R4. Here h : R& > R” 
denotes the forward map and v ~ (0, R) is d,-dimensional Gaussian noise with 
covariance matrix R € R4» X4», 

Let L : R& —. R denote the corresponding negative log-likelihood function. 
Since v is Gaussian it is given by 


m _ T p=1 i 
Lœ) = Urt) yr) R (h(x) — yr) (5) 


up to an irrelevant constant. The observations are combined with the predictive 
density 7y at time t = T according to Bayes’ theorem, 
-L 
e AT 
f e ^ Vom T (x)dx 

The process of transforming the random variable X7 ~ xr into a random variable 
Xf ~ mj is called data assimilation in the context of dynamical systems and 
stochastic processes [10, 17, 8]. 

Since performing data assimilation can be difficult if the relative Kullback— 
Leibler divergence 


KLGr [13) = f , 7r GO og xr (x) — log a (x))dx, (7) 


also called the relative entropy [13], between the prior xr and posterior m7 is large 
and/or if the involved distributions are strongly non-Gaussian [19, 1], we propose 
to construct a new SDE with state process X” such that XE ~ so and x ~ Nm. 
In other words, we are looking for a stochastic process (bridge) with initial density 
zo and final density x}. The problem of finding the optimal process (in the sense of 
minimal Kullback-Leibler divergence) is known as the Schrodinger bridge problem 


[4]. 


3 Schródinger Bridge Approach 


The Bayesian adjustment (6) at final time t = T leads in fact to an adjustment over 
the whole solution space of the underlying diffusion process described by (1). Let 
us denote the so called smoothing distribution by xê, t € [0, T] [18, 5]. It is well 
established that these marginal distributions can be generated from a controlled SDE 


dX; = f (X7)dt + gf (X?)dt + V20dW, (8) 
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for appropriate control g? : R^ —. R4 such that Xj ~ nj implies X? ~ a? 
for all t > O. It is also well known that finding a suitable g? can be formulated 
as an optimal control problem which in turn is closely related to the backward 
Kolmogorov equation [13, 16]. Formulations related to (8) have also been used in 
the context of sequential Monte Carlo methods [5]. 

As proposed in [16], an alternative perspective on sequential data assimilation is 
provided by Schrödinger bridges. Given two marginal distributions qo and qr and 
stochastic process X; (referred to as the reference process), a Schródinger bridge 
is another stochastic process X ; such that Xo ^ qo, X T ^ qr and the Kullback— 
Leibler divergence between the processes {Xi }re(0.7] and {X;}+e[o,7] is minimal. 
Specialised to our problem and considering a single data assimilation cycle that 
means the marginals are the initial and posterior densities, i.e. gg = mo and qr = 
T. and the reference process is the solution to (1). The solution to the associated 
Schródinger bridge problem is again of the form (8) with modified control term 
denoted by pee (x). 

A Schrédinger bridge is thus the optimal coupling as measured by the Kullback— 
Leibler divergence to the underlying reference process. Unfortunately Schródinger 
bridges lead to boundary value problems in the space of probability measures and 
the required control term ee seems rather difficult to compute in practice. In 
addition to the computational complexity of solving nonlinear Schrédinger bridge 
problems, the target distribution 27 is implicitly defined in the setting of data 
assimilation. The next section offers a solution to both of these issues. We point to 
[23] for a discussion of alternative approaches which introduce appropriate control 
terms into data assimilation procedures. 


4 Homotopy Induced Dynamic Coupling 


Since Schródinger bridges are computationally challenging, we ask whether a less 
optimal but cheaper approach might also be feasible. Indeed, in the context of data 
assimilation a non-optimal coupling can be found via a homotopy between the initial 
and target distribution as follows. Let 


nP(x) = Z; le^ T1), (x) (9) 


denote the homotopy in question, with Z, — f e The ^n, (x)dx the time dependent 
normalization constant. It clearly holds that "m = mo and na = m7. Note that 


: Er: "EUM . 
the scaling t ++ e~ TL was chosen for its simplicity and follows previous work on 


Bayesian inference problems [7, 14]. Finding better homotopies or systematic ways 
of constructing one could be an interesting direction for future research. 

We can then reason backwards from the Fokker-Planck equation of z/ to 
conclude that if it is the density of a random variable X” then that random variable 
must satisfy the modified SDE: 
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t 
dX? = f(xbar — = V LOG dt + gi (XDdt + /2edW,, (10) 


where g; is a solution to the PDE 
h l h at h a Zi 
V^ Gr 8) = zn L+tVL.- = VEaViOEm TOS (11) 
t 


The derivations of (11) can be found in Appendix 1. 
Note that (10) constitutes a mean field model since g; depends on the distribution 
z? of X^. We also wish to point out that 


^SB ot 
Bp œ) = — VEG) + gi) (12) 


provides a solution to the associated coupling problem. The control term eo is 
however non-optimal in the sense of the Schrédinger bridge problem since it does 
not minimise the Kullback—Leibler divergence. 

Since (11) is linear in g; we can decompose (11) into a set of simpler equations 
V - (n! gl) = x (kt — Ek’) such that the k’ add up to the right hand side of (11). In 
order to maintain f V - (xf gi) = 0 for the individual gi we can make use of the fact 


that the terms in (11) are of the form zb (k — EK). Separating the terms, we obtain 
the following equations, the sum of whose solutions solves (11): 
L L 
V- (agl) = hf ow 13 
Tr 8, 7, T T (13a) 
Y. (ate) = 45 ( t Eu 
‘(1,97 ) =; VL.f -E-VL-f (13b) 
T T 
t $ 
V. (x?) = sag? (Zv: -V log z” — VL V log?) (13c) 
h4 h ot ot 
V- (x; s) -o—mTVLSVL-ETVL.VL). (13d) 
Note that 
z^ VL. V logn? = V (z?vL) — n AL, (14) 


which can be used to avoid the computation of V log zb (with A = V. V the 
Laplacian operator) in (13c). Thus the controlled SDE (10) can be replaced by 


dx! = f(xbdr — VEX dt + (X Pdt + V2edW,, (15) 
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where 2; is a solution to 
ha 1 h ot nZt 
t 


Furthermore, if AL is a constant (as a function of x) or small in comparison to the 
other contributions in (16), then (16) simplifies further. In particular, this is the case 
if the forward map is linear, that is, h(x) = Hx. 

Building upon the mean field ODE (3), one obtains the equivalent controlled 
mean field ODE system 


d - - ot - x ahs 
aoe = f(x) - pL) + gi (XP) — o V log &P (X1), (17) 
with g; defined as before. This mean field formulation again requires knowledge 
(or approximation) of V log m A Gaussian approximation might be sufficient in 
certain circumstances giving rise to 


V log £P(x) ~ - (E^! (x — ib, (18) 


where ji^ denotes the mean of X^ and XP its covariance matrix. 


5 Numerical Implementation 


No analytic solution to (11) is known and we thus have to resort to approximations. 
We note that a similar PDE arises in the computation of the gain in the feedback 
particle filter [24] and one could use the diffusion map based approximation [22] for 
the problem at hand. This method also transforms the PDE into a Poisson equation 
which it then translated into an equivalent integral equation, the semi-group form 
of the Poisson equation. As the name suggests the integral equation makes use of 
the generator of a semi-group which can be approximated by diffusion maps. Here 
we instead propose to follow the constant gain approximation first introduced in the 
EnKF methodology [21]. 


5.1 Ensemble Kalman Mean Field Approximation 


Let us assume that AL 7 const in (16). Then we only need to deal with the modified 
negative log likelihood function 


Lœ) = Z (Le +1VL(x)- (reo = E) (19a) 
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x 2 (1% de L D L (: Atf (x) + ar2-vLc)  ) (19b) 


with Af being the time-step also used later for time-stepping the evolution equa- 
tions (10) or (15), respectively. Since L is given by (5), we define the modified 
forward map 


h(x) =h (: — Atf (x) + Ar vLO)) (20) 


and thus 


L(x) © 


x i os Sy 

— yr)" Ra) - yr) - 54,5 AG) — yr) R RE) - yr). 

(21) 

Following the standard EnKF methodology for quadratic loss functions, this 
suggests to approximate the drift function g; in (15) as follows: 


, t+At_.,, 4/1 
KF(y) = — ET ERT! (5 (ro) + z?u]) - vr) (22a) 
xh l/; hrz 
4 F; R- G (ño) nm (i) = vr) (22b) 


Here we have introduced the oa I; h[/] to denote the expectation value El of a 
function /(x) under the PDF mL. Furthermore, yrn denotes the correlation matrix 
between x and A(x) under the PDF m etc. The derivation of (22) can be found in 
Appendix 2. 


5.2 Particle Approximation and Time-Stepping 


The controlled mean field equations (15) can be implemented numerically by the 
standard Monte Carlo Ansatz, that is, M particles X 09 are propagated according to 


ax? = f(xX)ar — 7 ^1 y L(X )dr + gl KF (OX dt + /25aw ? (23) 


fori = 1,..., M. The required expectation values in eee are evaluated with respect 
to the empirical measure 


a, ay 2» = xP), (24) 
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The interacting particle system can be time-stepped using an appropriate adaptation 
of (61) from Appendix 2. The computation of gradients can be avoided by applying 
the statistical linearisation (60). 


6 Examples 


We now discuss a sequence of increasingly complex examples. The purpose is both 
to illuminate certain aspects of the proposed control terms as well as to indicate 
the computational advantages of the proposed methodology. All examples will be 
based on linear forward maps h(x) = Hx and, therefore AL is constant and can be 
ignored. 


6.1 Pure Diffusion Processes 


We set the drift f to zero in (1) and also assuming Gaussian initial conditions. Then 
the control term (22) gives rise to the mean-field SDE 


2ot 
dX? = /2caw, — =H" R (HX? — yr)dt (25a) 
1 20t? 1 
hyzyT —1 —1 T p-1 h h 
shy [7^ -TRROHHIR | (5 Gti) -yr ) a 


(25b) 


in the limit At — 0. We note that XP ~ zt? will remain Gaussian for all times and 
we denote the mean by u? and the covariance matrix by XJ. Hence, it holds that 
X? = SD and z’ [x] = ub. 

Please note that the additional drift term in (25a) is pulling x towards the 
observation yr regardless of the value of = It should also be noted that the drift 
term in (25b) can be both attractive or repulsive with regard to the observation yr 
depending on the eigenvalues of 


i4. 206^ duo acd 
Q% =R! — L R'HH ges (26) 
T T? 


The strength of this drift term is moderated by the covariance matrix Di 

We consider a one-dimensional problem with R = 0.01, o = 1, H = 1, yr = 1 
and T = 1. The initial conditions are Gaussian with mean 449 = 0 and variance 
Xo = 1. It follows that z is Gaussian with mean jz; = 0 and variance X, = 2 and 
the resulting Gaussian posterior xf has mean and variance given by 
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Fig. 1 Time evolution of the 


pure diffusion 


mean mi and the variance xj — variance 


—— mean 


under the mean field 
equations (25). Their values 
at final agree with the 
posterior values provided by 
(27) 


ui = Kyr © 0.9524, uł 22 —2K 7 0.0952 (27) 


with Kalman gain K = 2/(2 + 0.01) ~ 0.9524. 

In Fig. 1 one can find the time evolution of the mean and the variance under 
the mean field equations (25). The early impact of the data driven control term on 
the dynamics is perhaps surprising and quite opposite to the standard sequential 
approach to data assimilation where one first propagates to final time and only then 
adjusts according to the available data. It is also worth noticing that the sign of the 
corresponding Q, changes sign at t, = 4/2/20 implying that the drift term in (25) 
has a destabilizing effect on the dynamics for t > te. 


6.2 Purely Deterministic Processes 


We now set o. = 0 in (1). We obtain from (22) the mean field ODE system 


d t+ At E 
au = f(xb- er, XhH'R (52 Qut) -») (28a) 
t olske-dgysu "T 
*PEDR E (AXD +2; (T) vr) (28b) 
with 
h(x) = Hx — AtHf (x). (29) 


These equations can be expanded giving rise to 


d 1 l af 
=X? = f(x!) - = [zP «ix | ATR ELIC EP) -») (30a) 
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t 
- qzEMH RH (SXM e PUE) (30b) 


upon ignoring terms of order O(Ar). Unless the drift function f is linear, these 
mean field equations provide only an approximation to the controlled mean field 
equations (15). 


6.3 Linear Gaussian Case 


It is instructive to investigate the linear case 

f@)=Fx+b (31) 
in more detail where again everything remains Gaussian provided xh is Gaussian 
distributed, that is zto(x) = N(x; uo, £o). Under these conditions the densities 7r; 


and z] will also be Gaussian, we write zh(x) = N(x; uP, XP). The associated 
mean field equations follow from Appendix 2 and are given by 


d = 2ot _ 
—X = FX" +b +0o(®7!(X® — pw) — — H! R7! (HX? — yr) — (32a) 


dt T 
1 
— C,H! R7! (54 (x? + i) — vr) (32b) 
t 1 
a rebate Hn (SF (xb ut) +0). (32c) 
with 
lou fer. 201^ cher pi 
Drev NEP T2 XH R H. (33) 
A qualitative discussion can be performed in the scalar case, that is d, = 1, 


H = 1,o = 1,b = 0, T = 1 and F = å. One finds that the control terms involving 
F stabilize the dynamics whenever à > 0. This observation is in line with the fact 
that the data is crucial only if the dynamics in X; is unstable, that is, A > 0. 

We consider a two dimensional diffusion process with state variable x = 
(x1, x2) | and linear drift term (31) given by 


—2 1 
s pae) (34) 


b = 0, and diffusion constant o = 0.17. The forward operator is H = (1 0) and the 
variance of the noise R = 0.01. The initial distribution was zp = A ((1, 3), 0.027). 
The observed value at time T = 1 is set to yr = 2.5. 
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Fig. 2 Left panel: Time evolution of the mean in x; and x2, the two associated variances and the 
covariance between x; and x2 under the linear diffusion process. Right panel: Time evolution of 
the same quantities under the controlled diffusion process 


The posterior mean takes values wu} ~% 2.25 and 45 œ 1.50, while the posterior 
covariance matrix becomes 


Y^ x e 5] l (35) 


~ \ 0.0039 0.0503 


Numerical results can be found in Fig. 2. The impact of the control term on the 
linear diffusion process can clearly be seen and is most prominent on the observed 
x1 component of the process. The final values of the controlled process agree well 
with their posterior counterparts. 


6.4 Nonlinear Diffusion Example 


We consider a two-dimensional problem and denote the state variable by x = 
(x1, x2)! . The drift term is given by 


A 2 4 Se 
f@®=-VV®), V@= = (x2 - 2+ Bx?) +> l-4 (36) 


with parameters A; = 2000, A» = 5, and B = 1/5. The diffusion constant is set to 
o = 1. The choice of the potential V (x) has two effects: (1) there is a relative high 
barrier for particles to pass from positive to negative x;-values and vice versa; (2) 
the dynamics stay close to the parabola x? = 2 — Bag. 

The initial distribution is obtained by sampling x, from a Gaussian with mean 
1.5 and variance 0.0625. The x2 component is obtained from the relation 
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Fig. 3 Initial (blue) and final 3 .. nonlinear diffusion problem —. 
particle positions (red) under ~ pend 
the given evolution process 25 :ze aa 
together with the posterior TUE 
approximation provided by "E 
the EnKF (yellow). The 8 
à $15 
observed value is also = 
displayed T 
0.5 
0 
-3 2 1 0 1 2 3 
x, variable 
zur 2 
x2 —2-— fx. (37) 
We observe the first component x; of the state vector at time T = 1 with 
measurement error variance R = 0.01. The observed value is set to yr = —1.5. 


Due to the tiny observation error the posterior is centred sharply about the observed 
value. Furthermore, recall that the dynamics is essentially slaved to the parabola 
x2 =2- px? which makes the inference problem strongly nonlinear. 

All particle simulations are run with an ensemble size of M = 1000. Essentially 
identical results are obtained for M = 100. Smaller ensemble sizes lead to 
numerical instabilities. 

In Fig. 3, one can find the particle distribution at time t = 1 which constitutes 
the prior distribution for the associated Bayesian inference problem. It is obvious 
that a particle filter would fail to recover the posterior distribution which is sharply 
centered about the observed value. We found that increasing the ensemble size to 
M = 10,000 allows a particle filter to recover the posterior distribution; but the 
effective ensemble size still drops dramatically. The approximation provided by the 
EnKF is also displayed. The EnKF fails to recover the posterior due to its inherent 
linear regression ansatz which is inappropriate for this strongly nonlinear inference 
problem even in the limit ensemble size M — oo. 

In Fig. 4, the results from the controlled mean field formulation are displayed. 
It can be concluded that the posterior distribution is well approximated despite the 
constant gain approximation made in order to formulate the control term gEF in (22). 


6.5 Lorenz-63 Example 


All examples so far have considered a single data assimilation cycle only. We now 
perform a proper sequential data assimilation experiment for the standard Lorenz-63 
model [11] 
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nonlinear diffusion problem nonlinear diffusion problem 


* initial * t-0.02 


o final t=0.2 
25r 9 reference value 25r t-0.4 |- 
^o 1-08 


x, variable 
a 


x, variable 


X, variable X, variable 


Fig. 4 Left panel: Initial and final particle positions under the controlled evolution process. Right 
panel: Particle positions at intermediate times fy € [0, 1] 


d 
— X, = f(Xi), 38 
act fO) (38) 
where X, : Q — R? and 
a(y — x) 
fœ, y,z)= | x(b-z)-y (39) 
xy — CZ 


with parameters a = 10, b = 28 andc = 8/3. 
In order to obtain a reference solution X i for t > 0, the ODE (38) is solved 
numerically with step-size At = 0.005 and initial condition 


(0.587276 
X, = | -0.563678 | . (40) 
16.8708 


Scalar-valued observations are generated every Afops > O units of time using the 
forward model 


YnAtgps = H Xn Atos + Vn, n=1,...N, (41) 


with measurement errors v, ~ N(0, 1) and forward map H = (100) € R!*3, 
We use Atobs € {0.05, 0.1, 0.12} in our experiments and perform N = 20,000 
assimilation cycles. 

The initial ensemble DE | is drawn from the Gaussian distribution with 
mean X i and covariance matrix 0.017. We employ multiplicative ensemble inflation 
which amounts to replacing the Lorenz-63 dynamics by 
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"LÍ XO = fX) eye qii.  i=1,...,M, (42) 


with inflation factors 
og = 0.025k, k=0,...,9. (43) 


Here ji; denotes the empirical mean of the ensemble {X d 1- These equations are 
combined with the augmented evolution equations (30) and solved numerically with 
step-size At = 0.005 and ensemble sizes M € (5, 10, 15}. 

We report the resulting root mean square errors 


RMSE — wÈ [OT LED APIS (44) 


n= 


which are computed for each ensemble size M, observation interval Atobs and 
inflation factor og. The results are displayed in Table 1 where the smallest RMSE 
over the range of inflation factors {ox}p—-0 is stated for each M and Atops. We 
also state the corresponding RMSEs from a standard ensemble square root filter 
implementation [2, 8]. We find that the proposed homotopy approach outperforms 
the ensemble square root filter in terms of RMSE in all settings considered. The 
improvements increase for increasing observation intervals Atobs. The homotopy 
approach also appears less sensitive to the ensemble size M. 

We close this example by pointing out that less of an improvement could be 
expected for a fully observed Lorenz-63 system. The proposed homotopy approach 
seems particularly effective in guiding the unobserved solution components to 
regions of high posterior probability. See also the example from Sect. 6.4. 


Table 1 RMSE for both a standard ensemble square root filter (ESRF) implementation and 
our homotopy approach in terms of ensemble sizes M € {5, 10, 15} and observation intervals 
Atobs € {0.05, 0.1, 0.12}. The homotopy based data assimilation method leads to significantly 
reduced RMSEs in all settings 


MIAtobs 5 10 15 

0.05 ESRF 0.57 0.56 0.57 
0.05 Homotopy 0.55 0.55 0.55 
0.10 ESRF 0.85 0.82 0.82 
0.10 Homotopy 0.77 0.76 0.77 
0.12 ESRF 0.96 0.95 0.94 


0.12 Homotopy 0.86 0.86 0.86 
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7 Conclusions 


Devising alternative proposal densities has a long history in the context of sequential 
data assimilation and filtering. Here we have explored a computationally tractable 
approach which combines the concept of Schródinger bridges with a rather straight- 
forward homotopy approach. A further key ingredient is the approximate solution 
of the arising PDEs in terms of a constant gain approximation, which is also widely 
used within the EnKF community. Numerical examples indicate that the approach is 
viable and can overcome limitations of both standard sequential Monte Carlo as well 
as standard EnKF methods. This has been demonstrated for single assimilation steps 
as well as long-time data assimilation using the chaotic Lorenz-63 model with only 
the first component observed infrequently. It remains to be seen how the proposed 
methods behave for high dimensional stochastic processes. 


Acknowledgments This research has been funded by the Deutsche Forschungsgemeinschaft 
(DFG)- Project-ID 318763901 - SFB1294. We thank Nikolai Zaki for earlier work on the topic 
of this paper. 


Appendix 1: Derivation of Control Term Equation 


Given an evolution equation 
dX, = f(X,)dt + /2edW, (45) 
we obtain the Fokker—Planck equation 
dt = —V- (n; (f —oV logm;)). (46) 
Now we modify (45) by an additional drift term, i.e. 
dX = f(xbdr + g, (Xb)dr + V20dW, (47) 


with g, : R^ — R&. In that case, we would get a Fokker-Planck equation for the 
new equation: 


aah = -v. (= (f +%-oV log?) ) Sey: (=P (s - oV log?) - V. (Pg). 
(48) 
We can find g; in terms of known quantities as follows: we begin by taking the 
derivative of z? with respect to time: 


Ż L ANC 
3n! = r? (2 4 =) + zm. (49) 
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Next we substitute (46) for 0:7; and use zt; = Z,et gh: 
h h Ż L -1,-4L 
Qm, = — T Z + T Z; e TN . Gu Cf — oV logm)) (50a) 
t 
Z L t f i 
= nh (2 + 7) Zi lg TL. (Zetta, (/ — oV log Z,ef nj) 
t 
(50b) 
Z L H t t 
= zt (2 + 7) e Tly. (t (r- evi -eVloea?)) 
(50c) 
og" Z 2b Ev f-o LVL -o0oyVYlo ab (50d) 
= tU Um t\. T 8T, 
t h t h 
- TU VL. Js VL -oV logn, . (50e) 
Comparing with (48) it follows that we require 
vulg; = Lrt L+tVL- fay Vise Ven? "m ae ster 
t T t T t I t T 
(51) 
For the Zi term we have 
Z i € 
Da de Tm, (x)dx (52a) 
1 L f " Li 
uS ——e TIO. (x) + e TE 9r (x) dx (52b) 
t 
1_ 1 _ £1 (x) 
—-——EL—-—[|eT V. (m (f —oVlogz;)) dx (52c) 
T Zt 
"E EL VL- (f — oV log z) dx (52d) 
T ZJ T : ! 
: VL : VL- (f Vl ) (52e) 
s GN r = o e 
T T Mid 
l t h t 


where the third equality follows from integration by parts and the expected value is 
with taken with respect to nb. We finally note that 
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gi(x) = gi(x) — ez NLG). (53) 


Appendix 2: Ensemble Kalman Filter Approximations 


We provide details on the derivation of the EnKF-like approximation (22) to the 
controlled mean field equation (10) and the various simplifications that arise from 
assuming a linear forward map. 

We first recall that a continuous time formulation of the EnKF for a generic 
likelihood function 


l^ A A 
L(x) = 50) - yr) R7' (A(x) — yr) (54) 


is provided by 


d LA 1 sa 
gX = ER (5 (ÂX) + mth) - yr). (55) 


Here 7; denotes the law of X;, z;[g] the expectation value of a function g under z;, 
and 


xj = m | @ -DÅ - mila)" | (56) 


is the covariance matrix between the state x and the forward map h. 
Formal application of this approach to the two contributions to the likelihood 
function (21) leads to (22). More precisely, the first term leads to h = h and 


pe t+ At R (57) 
^ 2AtT 
while the second term results in h = h and 
R=- (58) 
— T 
The EnKF makes use of statistical linearization 
ES m [Vh] = x; (59) 


which holds provided z; is Gaussian or if h is linear; a result known as Stein's 
identity. The identity can also be used to approximate derivatives in a (weakly) non- 
Gaussian setting giving rise to 
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Vh(x) (Z^) E, (60) 

We also recall the robust time-stepping method 

xh i M (Y( 
X, 7 Xn = At Ez (Arghh + R) 7 (Rau) mI) — yr), (n 


which again can be adjusted appropriately to (22). 
We now assume a linear forward map, that is h(x) — Hx, and discuss the 
simplifications that result in the computation of (22). Note that 


E t 
h(x) = Hx — AtHf (x) + At HH R^ Hx. (62) 
Hence the covariance matrix x can be reformulated to 
; t ' 
z?LIPH-AUDSH LACOEUH'ROHHU (63) 


and (22) simplifies to 


R 1 ofA 
r(x) = -7E H' R (5 (Hx + Hut) - yr) (64a) 
t x .1f1 
E pEUH'R (s (Hx + Hu?) - yr) (64b) 
OP oe etsrs ptt | h 
+ SSEPHTRHHTR 5 (Hx + Hut) -yr (64c) 
t 
= yE H RH (Sœ +27 1/1) (64d) 
Ler, on re d h 
+ IZE HTRHHTR 5 (Hx + Hut) -yr + O(AD) 
(64e) 
1 
=-C,HTR' G (Hx + Hp) - vr) (64f) 
t 7 
- g; PH RH (feo +x? Lf) ocn (64g) 
with 
G= toe 4 Lo -ZC pete in 65 
cS pe tg T? (93) 
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Upon dropping terms of order O(Af) and using X" = XJ, we obtain (25) for 
f = 0 and (30) for o = O as special cases. The mean field equations (32) also 
follow easily from f(x) = Fx + b. 
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Constrained Random Diffeomorphisms A 
for Data Assimilation or | 


Valentin Resseguier, Yicun Zhen, and Bertrand Chapron 


1 Introduction 


For ensemble-based data assimilation purposes, there is a definite need for relevant 
ensemble sampling tools. Indeed, the quality and spreading of these ensembles 
have deep implications in the quality of the data assimilation (Dufée et al 2022), 
and—until recently—those so-called covariance inflation tools have mostly relied 
on unsuitable linear Gaussian frameworks (Tandeo et al 2020, Resseguier et al 
2020a). A promising alternative is the generation of ensembles through a stochastic 
remapping of the physical space. 

Consider a random mapping T, acting at every infinitesimal time step, such that 
T, (x) — x is interpreted as a “location perturbation" expressed by 


T, (x) =x + a(t, x) At + ei(t, x) Ani(t), (1.1) 


where a(t, x), ej(t, x) € R”. In Eq.(1.1), a(t, x) controls deterministic location 
shifts, and An;(t) ^ JN (0, At) random ones. At every time step, this random 
mapping T shall induce a perturbation to any tensor field 0 (t) (Zhen et al 2022). 
For instance, one can perturb a differential form 6(t) applying 0(t) > T*0 (t) with 
T* the associated pull-back operator. 
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A rigorous mathematical definition and calculation of T; and T* can be obtained 
in terms of stochastic flows of diffeomorphisms and its Lie derivatives (e.g., 
Bethencourt De Leon 2021). Yet, to rapidly assess 7T,*0, a Taylor expansion and 
Itó's lemma can be used. Given coordinates (x!, ..., x"), when 0 is a differential 
k—form, it can be written as 


0— > fivc Jes ^ dx", (1.2) 


ij «...«ik 


with f a semimartingale smooth in space. Then 


To= Y» FEOT uas Ande, (1.3) 


ij «...«ik 
leading to a compact expression 
T0 —0 + M(t,0)At + Ni (t, O) Ani(t), (1.4) 


with some differential k—forms M (t, 0) and V; (f, 0). Appendix 4 provides defini- 
tions of M and N (see Zhen et al 2022, Appendix B for a full proof). 

Hereafter, we present and discuss the potential of this random mapping scheme 
to possibly prescribe 0, and the parameters a and e; to ensure that certain quantities, 
i.e. mass, vorticity, helicity, energy, are conserved. 

Several examples of 7;*@ can indeed be considered. For instance, when 0 = f is 
a function (differential 0—form), 


(7/0) =f + (aia, f ther el dcr dya f) At + eP dep f Ani. (1.5) 
 ——— — —— —— 
=M =N; 


And when 0 = f dx! ^--- A dx” (differential n-form), it then follows 
1/6 =| f + (Gira? + 340 + (a? + eo eoo f + delecta f) At 
+ Que? f + eP xn f) Ani fax! ^ A dx". (1.6) 
Finally, when 0 = f/dx/ = ae | f| dx is differential 1-form, we have 
T0 =| FI (aPAxr fi + de? eer dxa fi 0,;aP f? + yjet e Axa FPA 


+ (e? 8,» fi + a,se? f") Ami Jax! (1.7) 
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2 Induced Stochastic PDE 


From the expression of 7,0, a SPDE is derived from an original PDE, when 0 is a 
differential form. Suppose $^ is the full state variable of the deterministic dynamical 
system: 


asi 


I = gis). (2.1) 


Let f? be a component or a collection of components of 57. We then associate f? 
to a differential form 04, i.e. there is an invertible map F that maps the space of f? 
to the space of 07, such that F( f1) = 64. Typically, if Í d is a tracer, it is often 
associated to me 0-form 07 = f 4 If ff is the deno 04, we might associate the 
n-form 04 — od dxi! a- --Adx'^, More generally, 64 , and thus F, can be prescribed 
to ensure that certain quantities—such as mass, energy, circulation—are conserved 
(Zhen et al 2022, section 3.3). Consider the propagation equation for f 


af? = gf (sar. (2.2) 
It implies a propagation equation for 0: 
do? = g?(S^)ar. (2.3) 


We will now stochastically perturb the above deterministic dynamics. Let us denote 
S, f and @ the semimartingale solutions of this randomized dynamics. The proposed 
discrete-time perturbation at each time step consists of the following two steps: 


Ó(t + At) = O(t) + g° (S(t) At, (2.4) 

0(t + At) = T;0(t + At), (2.5) 

with T*O(t-- At) = = O(t+Ath+M(t, (AD) At+Nj (t, Ó(t-- At)) An; (f) -o( At) 
for the associated differential forms M(t, 0) and N; (t, Ô). 

The deterministic PDE (2.4) and NG + At) — 0(t)|| scales in O(At). There is no 

noise term to induce a scaling in O (v At). Therefore, it can be assumed that there 


exists C > 0 so that | M(t, Ó(t + AD) — M(t,0(0)|| < CAt and |N; (t, Ó(t + 
At)) — Ni (t, 9(£))]|| < CAt, for At small enough. Accordingly, 


T*Ó(t + Ar) =O(t + At) + (ma, O) + O(Ar)) At 


+ (Nile, 6) + O(AD) Ani(t) + 040), 


—ó(t + At) + M(t, OMAt + Ni(t, O(t)) Ani(t) + olt). 
(2.6) 
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Therefore, 


(t + At) = O(t) + g? (SOA At + M(t, 0(0) At + Nj(t, O(t)) An; + olt). 
(2.7) 


It suggests the following stochastic propagation equation for 0: 
do = g^ (S)dt + M(t, @)at + Ni (t, 0)dmi. (2.8) 


Since there is a 1-1 correspondence between 0 and f, Eq. (2.3) also suggests a 
stochastic propagation equation for f, which can be written as 


df = g/(S)àt + Mf (f)àt +Nf (fan. (2.9) 
We denote the additional terms in Eq. (2.9) by 
d, f := MF (f)àt + NJ (Pani. (2.10) 
Then Eq. (2.9) can be written as: 


df = g/(S)dt + d; f. (2.11) 


3 Comparison with Other Perturbation Schemes 


Obtained above, ds f is completely determined by 7*6, but is not directly related to 
the original dynamics Eq. (2.2). Once the expression of T in Eq. (1.1) and the choice 
of the differential form 07 are determined, the perturbation term d, f is prescribed. 
However, the choice of 04 is up to the user, and may then be related to the original 
dynamics. 

In the following, we thus demonstrate that both the stochastic advection by Lie 
transport (SALT) equation (Holm 2015) and the location uncertainty (LU) equation 
(Mémin 2014, Resseguier et al 2017; 2020b) can be properly recovered using the 
proposed perturbation scheme. 


3.1 Comparison with the LU Equations 


The Reynolds transport theorem is central to the LU setting. The Reynolds transport 
theorem expresses an integral conservation equation for the transport of any con- 
served quantity within a fluid, connected to its corresponding differential equation. 
A link between the proposed perturbation approach and the LU formulation can 
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be anticipated to be related to differential n-forms. But first, we consider a key 
ingredient of LU: the stochastic material derivative of functions (differential 0- 
forms). 


3.1. 0-Forms in the LU Framework 


Dropping the forcing terms, LU equation for compressible and incompressible flow 
reads (Resseguier et al 2017) 


9, f -w* - Vf =V- GaV f) - o B. Vf, (3.1) 


w* =w — 4(V- a)! t oe(V.o)!, (3.2) 


where a = Tol, and f can be any quantity that is assumed to be transported by 
the flow, i.e. Df/Dt = 0 where D/Dt is the Itó material derivative. For instance, 
f could be the velocity (dropping forces in the SPDE), the temperature, or the 
buoyancy. 

Separating the terms of the SPDE related to the deterministic dynamics from the 
term associated to the stochastic scheme, it comes 


dl" f = gf (S)at + al" f, (3.3) 

where 
g/(S)=-—w-Vf, (3.4) 
alU f = — (w* — w)- V fdt -—odB- Vf + V- (aV f)dt. (3.5) 


Besides, from our proposed scheme applied to a 0-form 0 = f (Eq. (1.5)), we 
obtain: 


df = (a^à.; f t+ deh el ap dea f)at + e” dep f án. (3.6) 


To physically interpret this equation, we rewrite: 


d 
af = —VP 3yo f + dyp (Gepe dxa f) , (3.7) 
where 
d s. 
V? = —a? + 1o (eP e) — eP E (3.8) 


Terms of advection and diffusion are recognized. The matrix Jeje! is symmetric 


non-negative and represents a diffusion matrix. The p-th component of the advect- 
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ing velocity VP is composed of the drift —a?, a correction Faxa (elei), and a 
E . . p dni 
stochastic advecting velocity —e; 4+. 


Direct calculation yields that Eq.(3.5) coincides with Eq. (3.7) when a = 


NT EM = ejeT ando B = —e;dn; and 
T(x) =x+ e] s e; At +e An = x — wSAt + (-wS At — o AB), (3.9) 
where 
w$ = —i (oek: Vox — —4(V- a)! + 50(V-o)!. (3.10) 


The LU equation can thus be derived by choosing 0 = f and T; by Eq. (3.9). Note, 
the term (—w$ At — o AB) = (5618s, ei At + ei Ani) is the Itó noise plus its Itó- 
to-Stratonovich correction. Hence, it corresponds to the Stratonovich noise e; o drj 
of the flow associated to 7;. The additional drift —w At is different in nature. It 
is related to the advection correction w$ - V f in the LU setting. Indeed, in the LU 
framework, the It6 drift, w, is seen as the resolved large-scale velocity. That is why, 
in this framework, the deterministic dynamics (3.4) involve the It6 drift, w. This 
is also the reason why, under the LU derivation, the advected velocity is assumed 
to be given by the Itó drift, w. It differs from the Stratonovich drift ws = w + 
w$, used as advected velocity in SALT approach or in Mikulevicius and Rozovskii 
(2004) (where the Stratonovich drift is denoted u). Interested readers are referred 
to (Resseguier et al 2020b, Appendix A) for a discussion on these assumptions and 
for the complete table of SALT-LU notations correspondences. Note however that 
in all these approaches, the advecting velocity is always the Stratonovich drift. This 
can be seen e.g., in the Stratonovich form of LU equations, derived in (Resseguier 
2017, Appendix 10.1) and (Resseguier et al 2020a, 6.1.3): 


9, f -ws- Vf =- (o o B). Vf, (3.11) 


where o o B is the Stratonovich noise of the SPDE. Since the advecting velocity ws 
and the resolved velocity w differ by a drift wç, the term w$ - V f is interpreted as 
an advection correction, being part of the stochastic scheme (3.5). Accordingly, the 
remapping 7; involves an additional drift —w§ At . 

To also understand (3.9), the inverse flow can be considered: 


T! (x) =x — ei Ani =x +0 AB. (3.12) 


If T, represents a necessary perturbation to match, at each time step, a true solution, 
(o3 measures the difference, at each time step, between this true solution and 
a model forecast. Therefore, the LU equation can be derived using the proposed 
perturbation scheme, choosing 0 — f and assuming that a true solution differs from 
a model forecast by a displacement prescribed by Eq. (3.12). 


Random Diffeomorphisms 287 


3.1.2 n-Forms in the LU Framework 


The LU physical justification relies on a stochastic interpretation of fundamental 
conservation laws, typically conservation of extensive properties (i.e. integrals 
of functions over a spatial volume) like momentum, mass, matter and energy 
(Resseguier et al 2017). These extensive properties can be expressed by integrals 
of differential n—forms. For instance, the mass and the momentum are integrals of 
the differential n—forms pdx! ^ --- A dx" and pwdx! ^ --- ^ dx", respectively. 
In the LU framework, a stochastic version of the Reynolds transport theorem 
(Resseguier et al 2017, Eq. (28)) is used to deal with these differential n—forms 
6 = fdx! A--+ ^ dx". Assuming an integral conservation 4. Ivo f =Oona 
spatial domain V (t) transported by the flow, it leads to the following SPDE: 


Df g d t t . 
v wtb) Ds. f Veo) = ee) ou! f 
0 0 


Dt 
(3.13) 
where D/ Dt denotes the Itó material derivative. Forcing terms are dropped for the 


sake of readability. This SPDE can be rewritten using the expression of that material 
derivative (Eq. (9) and (10) of Resseguier et al (2017)): 


af +V - (ws f) 21V - (aV f)  1V - (o. (V 0,07 f) - V (o Bf). 


(3.14) 
The original deterministic equation and stochastic perturbation correspond to 
a! (S) «— V - (wf), (3.15) 
aj" f -V - (w$ f) 3V - (aV f) 3V - (eu (V «0, f))dt- V - (CAB f), 
(3.16) 
=- V. ((-GV - a)! dt + adB)f) + V - GaV f)at. (3.17) 


We can now compare these LU equations to our new stochastic scheme applied 
to n-form 0 = fdx! ^: ^ dx" (Eq. (1.6)). This implies that 


ds f - (Gira? + 340 f + (a? + cP Acne! xe f + Tele Axvdxa f)àt 
+ (Bye? f + e? yp f)àni, (3.18) 
where J; = Oy» e? dxa el — ðyr ef Oxa e? . Rewritten, it leads to: 


ds f 
dt 


= — dar (V? f) + div (GeleDàs f). (3.19) 
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where 


dni 
V? = V? — (ede?) = —a? + 5 (Br0e? ef — ep dese) — ep 


(3.20) 
Again an advection-diffusion equation is recognized, but of different nature. Indeed, 
as expected for an n-form, the PDE is similar to a density conservation equation. 
Moreover, the advecting drift is slightly different to take into account the cross- 
correlations between f (T; (x)) and T*(dx! ^c A^ dx"). 


Identifying a — oo! = ever and o B = —e;dnj, 
V=-aP+ 5 (dae? e] — eL soe?) — E = -GV - a)? +oB, (3.21) 
i.e. 
a? = 5(dxaeP ed — eP Axaet) + 5x0 (el ef) = ef oae]. (3.22) 


A remapping is thus obtained to write 
T(x) =x + e 0s ei At + ei Ani = x — wSAt + (-wS At — o AB), (3.23) 


already derived for differential 0—form in LU framework (Eq. (3.9)). Therefore, 
the proposed perturbation mapping can also encompass the LU framework for n— 
forms, and its capacity—given by the Reynolds transport theorem—to deal with 
extensive properties. 

Moreover, for incompressible flows, LU equation further imposes that 


V-o —0, 
id (3.24) 
V. V.a —0. 
Translating it into our present notation, it reads as 
P . 
0x, € oe each i (3.25) 
Ox, Ox, (e; ej ) = 0. 


Following straightforward calculation, Eq. (3.24) is found equivalent to that 7,0 = 
0 for 0 = dx! ^--- A dx”. Such a result is expected since constraints Eq. (3.24) are 
obtained from the LU density conservation. 
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3.2 The SALT Perturbation Scheme 


Holm (2015) derived the original SALT equation following a stochastically con- 
strained variational principle 6S = 0, for which 


| Su, q) = f t(u, q)àt, (3.26) 


dq + £axq = 0, 


where £(u, q) is the Lagrangian of the system, £ is the Lie derivative, and x; (x) is 
defined by (using our notation) 


t t 
Xi(x) = xo(x) «f u(x,s)ds — Í ei (x) o dni (s), (3.27) 
0 0 


in which u is the velocity vector field. The o means that the integral is defined in the 
Stratonovich sense, instead of in the Ito sense. Hence, dx; = u(x, t)dt — ej o dni 
refers to an infinitesimal stochastic tangent field on the domain. We can express 
dx; = T; (x)—x--udt. Note the difference between Ito's notation and Stratonovich’s 
notation, i.e. e; o dn; 4 e;dn;. The initial expression of T; essentially follows Ito's 
notation. In this subsection, it comes that 7; (x) zz x — ei Ani. Instead, it becomes 
T(x) =x + Tel xei At — ej Ani. 

In the second equation of Eq. (3.26), q is assumed to be a quantity advected by 
the flow. q can correspond to any differential form that is not uniquely determined by 
the velocity (since the SALT equation for the velocity is usually determined by the 
first equation of Eq. (3.26)). Holm (2015) evaluates the Lie derivative £ax,q using 
Cartan's formula: 


£axq = d(iaxq) + iax,dq. (3.28) 


This Lie derivative £4,,q corresponds to T,*q — q + f^(S)àt, if we assume that 
the deterministic forecast of q is simply the advection of q by u. More generally, 
£ax,-uarq = T;*q — q. Therefore, the SALT equation for q is the same as our 
perturbation for q. Note, the Cartan's formula can not be directly applied to calculate 
the Lie derivative if the expression of dx; is in Ito’s notation. 

Within the SALT setting, the velocity u comes from the first equation of 
Eq. (3.26). For most cases, the velocity u is associated with the momentum, a 
differential 1—form m = u/dx/ = uldx! + ... + u" dx". When the Lagrangian 
includes the kinetic energy, Holm (2015) observed that the stochastic noises 
contribute a term £a;,0, where 0 is a differential 1—form related to the momentum 
1—form. In particular, 0 = m for the “Stratonovich stochastic Euler-Poincaré flow” 
example, and 0 = m+ R/dx/ for the “Stochastic Euler-Boussinesq equations of a 
rotating stratified incompressible fluid". 


290 V. Resseguier et al. 


Already pointed out, the operator £a;, is closely related to T*, and the SALT 
momentum equation can thus also be derived using our proposed perturbation 
scheme by properly choosing 0, without relying on Lagrangian mechanics. 

Another way to appreciate the correspondence to SALT is by looking at the final 
SPDE. If we choose 0 to be a differential 1-form to represent the momentum f, i.e. 
0-—f Jdx/J we obtain from Eq. (1.7) (more details in Zhen et al 2022): 


d, fÍ = (a? Axo fÍ + xe? ef Ayr dua fi + 8,;a? f" + 0 je ef Axa f ")àt 
+ (eP 8» fÍ + 9,; e f? Ani. (3.29) 


Regrouping the terms for physical interpretation, it writes: 


ds fi 
dt 


= -VP Av fÍ + dav (GeleDs f!) 
+ yj (« Pus Qr en x) f? -- ü je et aya fP. (3.30) 


Two last terms of the right-hand side complete the advection-diffusion terms, 


already appearing in (3.7). The first one, 0,/ (-a" — ep St 


x) fP, is reminiscent 
to the additional terms appearing in SALT momentum Eton: (Holm 2015, 
Resseguier et al 2020b). The second term, —O,;e e? Axa f?, comes from cross- 


correlation in Itó notation. 


4 Conclusion 


As demonstrated, both SALT and LU equations can be recovered using a prescribed 
definition of a random diffeomorphism 7; used to perturb the physical space. 
However, compared with SALT and LU settings, the proposed perturbation scheme 
does not directly rely on a particular physics. Hence, the random mapping is more 
flexible and can be applied to any PDE. Interestingly, similarities and differences 
can then be identified and studied between the proposed use of the random 
diffeomorphism and the existing stochastic physical SALT and LU settings. For 
instance, the proposed derivation provides an interesting interpretation the operator 
£ax,—uar, appearing in the SALT equation. This term can indeed represent an 
infinitesimal forecast error at every forecast time step. 

To apply the proposed perturbation scheme to any specific model, the diffeomor- 
phism parameters a and e; must be determined specifically. Hence it is necessary to 
learn these parameters from existing data, experimental runs, or additional physical 
considerations. This framework naturally provides new perspectives to generate 
ensembles through constrained stochastic mappings applied in the physical space. 
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Appendix: Expression of T*0 


Given coordinates (x!,..., x") and a differential k—form 0, Zhen et al (2022) 
(Appendix B) proof that: 


T*0 = f (T; X))T, (dx ^--- ^ dx'*) 


Ton i i 
-64 1 (WV fia) +56 Hye; + IW)dx" A+++ A dx" 


k 
+) füua^dx^ ^. dx] ^A dx" 


s=1 


+> fue Quel dx! aa dxi ^ adx! A+++ Adx't) 
S«r 
k . 
+ OR ee ei)ü,;e dx! A+++ A dx! A+++ A dx*)} At 


s=1 


a [vs e;)dx!! A... A dx” 


k 
+Y fa,ie dx" ^^ dxl ^ c^ dit] Ani 


s=1 


+ 0(At), (A.1) 


where IW is the additional term appearing in the Itó-Wentzell formula (Kunita 
1997). Here, there is no noise in the original dynamics (2.3) (the first step (2.4) 
of the randomized dynamics) which could be correlated with the noise of the 
resulting stochastic scheme (2.5). That is why TW = 0O in the above Taylor 
development of f. Indeed, there is no additional cross-correlation term between T* 
and 6(t + At) = O(t) + g?(S()) At . The final SPDE (2.8) makes clear the link 
between the solution 0 and the Brownian motions rj. But, at a given time step f, 
since (2.2) has no noise term, 6 (t + At) is correlated with the t’ — n;(t') for 
t < t only, and is independent of the new Brownian increment An; (f) generating 
T;. Therefore, there is no cross-correlation term between T;* and à(t + At). 

To simplify Eq. (A.1), wedge algebra is applied and the high-order infinitesimal 
O(At) is ignored. Accordingly, 7,*0 is more compactly written as 


T^0 = 0 + M(8)At + N (0) Ami, (A.2) 


for some differential k—forms M (0) and M; (0). 
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Stochastic Compressible Navier-Stokes A 
Equations Under Location Uncertainty or | 


Gilles Tissot, Étienne Mémin, and Quentin Jamet 


Abstract The aim of this paper is to provide a stochastic version under location 
uncertainty of the compressible Navier-Stokes equations. To that end, some clar- 
ifications of the stochastic Reynolds transport theorem are given when stochastic 
source terms are present in the right-hand side. We apply this conservation theorem 
to density, momentum and total energy in order to obtain a transport equation 
of the primitive variables, i.e. density, velocity and temperature. We show that 
performing low Mach and Boussinesq approximations to this more general set of 
equations allows us to recover the known incompressible stochastic Navier-Stokes 
equations and the stochastic Boussinesq equations, respectively. Finally, we provide 
some research directions of using this general set of equations in the perspective of 
relaxing the Boussinesq and hydrostatic assumptions for ocean modelling. 


1 Introduction 


Stochastic modelling under location uncertainty (LU) relies on the decomposition 
of the displacement of fluid particles into a time-differentiable velocity field, and a 
highly fluctuating component represented by a Brownian motion. It was proposed 
in [15] to apply this principle to fluid flows, leading to a stochastic version of the 
Navier-Stokes equations. On this basis, a similar derivation has been performed 
for various ocean models, such as Boussinesq models [16], quasi-geostrophic (QG) 
models [4, 14, 17], surface quasi-geostrophic models (SQG) [18] and shallow water 
equations [5]. 
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In the ocean, variations of density, temperature and salinity are of great impor- 
tance. In the previously cited models the Boussinesq assumption of small com- 
pressibility has been assumed from the start. This is a fair approximation, but 
it can become limiting, for instance when radiative transfers heat the ocean 
surface. Some research efforts have been performed to account for compressibility 
in deterministic oceanic flow models [20, 9, 8] in order to obtain energetically 
consistent formulations. Another key aspect is that, Boussinesq models cannot 
sustain acoustic waves, which is relevant for two major applications: (i) ocean 
acoustics and (ii) numerical simulations of non-Boussinesq models, where pseudo- 
compressibility strategies [7] are employed to compute the pressure with explicit 
schemes without having to solve an expensive 3D Poisson equation [3]. In addition, 
a rigorous development of Boussinesq systems requires to perform the Boussinesq 
approximations on the compressible equations [22]. 

The derivation of a compressible stochastic system cannot be derived from the 
incompressible stochastic system, since it corresponds to a generalisation step. 
We propose in this paper to start from the classical physical conservation laws 
to derive a general stochastic compressible Navier-Stokes system. We verify that 
the provided set of equations is consistent with the incompressible stochastic 
models previously developed. We will moreover theoretically show some potential 
developments enabling to perform a relaxation of the Boussinesq assumption. Such 
a procedure will allow us to propose stochastic systems of increasing complexity 
lying in between Boussinesq hydrostatic system and a fully compressible flow 
dynamics. 

The paper is organised as follows. In Sect. 2, we briefly recall the LU formalism 
and provide a convenient form of the stochastic Reynolds transport theorem when 
the budget of conserved quantities is balanced by external source or flux terms of 
stochastic nature. In Sect. 3 we develop the stochastic compressible Navier-Stokes 
equations. In Sects. 4 and 5, the low Mach number and Boussinesq approxima- 
tions are performed respectively. We verify in these two sections the consistency 
with stochastic models previously derived from stochastic isochoric models [16]. 
This stochastic Boussinesq model is generalised by incorporating thermodynamic 
effects. In Sect. 6 these approximations are relaxed and we propose a model which 
can be integrated explicitly in time, similarly as in [3]. In Sect. 7 some concluding 
remarks are given. In appendix, technical calculation rules and important details to 
perform energy budgets are provided. 


2 Stochastic Reynolds Transport Theorem 


The transport of conserved quantities subject to a stochastic transport is described 
by the stochastic Reynolds transport theorem (SRTT) introduced in [15]. When 
stochastic source terms are involved in the budget, additional covariation terms have 
to be taken into account [16]. These terms are usually defined in an implicit manner. 
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In the present section, we briefly present the modelling under location uncertainty, 
and we rewrite the SRTT in a convenient form for further developments. 

In the modelling under location uncertainty [15], the displacement X (x, t) of a 
particle is written in a differential form as 


dX (x, t) = u(x, t)dt + o;dB,, (1) 


where u — (u,v, w)? is a time-differentiable velocity component, and dB; is 
the increment of a Brownian motion, whose aim is to model unresolved time- 
decorrelated velocity contributions. The correlation operator o; is an integral 
operator which involves a spatial convolution in the domain £2 with a user-defined 
correlation kernel ð, such that 


(c,dB;) (x) = f & (x, x' Bl (x^ dx’. (2) 
2 


Associated with o;, we define the (matrix) variance tensor a (that corresponds to 
the one point covariance tensor) such that 


aij (x)dr = E ((0dB,)' (x) (eB) (x)). (3) 


Within this framework, the stochastic transport operator of a scalar quantity q is 
defined by 


1 
Dq drq + (u- jv ato 0n) V q dt 
(4) 


~ 


u* 


1 
+ (c;dB; - V)q — 2"" (aVq) dt, 


where u*, called drift velocity, is the resolved velocity corrected by the inhomogene- 
ity and divergence of the noise correlation tensor, respectively. Physical relevance 
of the drift velocity and the stochastic diffusion IV - (aVq) dt has been extensively 
highlighted in previous studies [e.g. 6, 4]. 

Variation of q integrated over a transported volume [16] can be written 


af q dx =Í (Diq + qV - (u* dt + o;dB;) + V-(o;h) dt) dx, (5) 
Rt) Q(t) 


with h defined as follows: when the stochastic transport operator is isolated on the 
left-hand-side (LHS), h is associated with the martingale part of the remaining right- 
hand-side (RHS): 


ha = fdt-+h-dB,. (6) 
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Starting from (5), we can assume that some source terms Q; dt + Q, -dB;, with 
a time-differentiable and a martingale contribution respectively, are balancing the 
budget of q in the control volume such as 


af qd = [ (QO, dt + Q, -dB,) dx. (7) 
Q(t) Q(t) 


These RHS terms correspond to forces (resp. work) when this general expression 
is associated with the momentum (resp. energy) equation. Dropping the volume 
integral, we can now identify h 


h-dB, = —qV -(o;dB;) + Q, -dB:. (8) 


This leads to the explicit expression of the stochastic Reynolds transport theorem 


1 1 
dig ys ((« -3V-8)d toan) a) +V- (0, Q,)dt — 5V - (aVq) dt 


= Q: dt + Q,-dB;. 

(9) 
The absence of the term o; (V -o;) in the modified drift is an important feature of 
this expression. It has been cancelled (not neglected) by accounting for the term 
—q V -(o;dB,) in Eq. (8). As it will be detailed further, this term will reappear when 
we will transform the conservative form of the equations to their associated non- 
conservative form, i.e. writing a transport equation for the primitive variables. For 
consistency checking, it has been assessed in appendix 7, that the same expression 
is obtained using a Stratonovich stochastic integral convention. 


3 Stochastic Compressible Navier-Stokes Equations 


To obtain the stochastic compressible Navier-Stokes equations we apply the SRTT 
equation (9) to the mass, momentum and total energy. This requires at first to 
properly define the physical variables. 


3.1 Non-dimensioning 


We consider the time t, x = (x, y, z)! the space coordinates of 2, and (ex, €y, €z) 
the associated canonical basis. Physical quantities are marked by e? and the other 
quantities are non-dimensional. We adimensionalise by reference conditions (noted 
e), and introduce a reference distance Lr, velocity U, density p.r sound speed 
c,, AS Well as viscosity M, We get 
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xÝ 1? Ures u? g" Uses 
x= — , = , Se , CL ; M = , 
Let La U ver User Cret 
$ $ [ T? $ 
p H P C 
pex dw que. ar qus -—56 ; (10) 
Pret Href Pret Ue rep Uet 
c$ e? T 2°? Lis 
Maz sae de Stee aw BS 3" 
Cy Ut y Dyer 


with u the velocity vector, c the speed of sound, M the Mach number (i.e. the ratio 
of typical particle speed to typical sound speed), o the density, p the pressure, u 
de dynamic viscosity, T the temperature, y the heat capacity ratio, (cp, cy) the heat 
capacities at constant pressure/volume, e the internal energy and g — —ge; the 
acceleration vector due to gravity. We introduce as well the Reynolds and Prandtl 


numbers 
$ 10 
Tel Ure Ds C 
Re= Pret ret ret >: Pr= pH , (11) 
Heret k$ 


with ke the thermal conductivity. 


3.2 Continuity 


Mass conservation ensues upon applying the SRTT on density, i.e. q = o with no 
mass source of any kind: 


So V ((w- 5v maneo) p) 5v voa (12) 


3.3 Momentum 


Applying now the SRTT to the momentum pu; balanced by forces, with u; € 
{u, v, w}. 


1 * 
di(oui) + V+ ((« EUN eB) di + dB) pui) + V * (o; F2") dt 


dp; 
Ox; Ox; 


1 rjj (u) 1 dT; (o;dB;) 1 
dt V-(av ;)) dt, 
ou V UNI ae 9 ph EN DIADD 


(13) 
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o 
with FS" . dB, = — ir + EV - (t;(0;dB,)). The forces involved here are 
L 


caused by pressure gradient, viscous stresses T and gravity. The pressure gradient 
is decomposed in a time-differentiable part p dt and a random component dp? . For 
sake of generality, we consider the molecular viscosity stress tensor: 


ru) = u (Vu + (Vu)") + (m - gu) Vul (14) 


with up the bulk viscosity. Similarly to the pressure, there is a finite variation friction 
contribution due to u dt and a martingale contribution due to o;dB;. 

After some manipulations and using the stochastic distributivity rule (70) given 
in appendix 7, we obtain 


p Vrti 


] ! 9 f(l( dp 1 uod) 
d dB,)*, s JA 
TE Af p(osdBs) i am (<( d T Re ax; ) 


Op dt üdp? " 1 ðt; (u) P 1 dt;;(0;dB;) 
Ox; OX; Re 0x; Re Ox; 


089i e. . 
(15) 


This expression is very similar to the momentum equation of the incompressible 
Navier-Stokes equations [15, eq. 41 with incompressibility assumption]. We only 
have as an additional term the covariation between (the martingale part of) forces 
and the small scale component po;dB;. This term, usually difficult to evaluate 
analytically is generally neglected through a slight variation of the expression of 
Newton's law in the LU framework, as for instance in [16, Appendix E]. 


3.4 Energy 


As in the deterministic framework [23, 2, 13], we now consider conservation of the 
total energy and deduce a transport equation for the temperature. 


General Formulation 
Work of forces and heat fluxes acting on a transported control volume induce 
variations of total energy E such that: 


d;(pE) 4- V- ((« = iv - a) dt *odB;) 2 t V. (o,FPP) 
(16) 


1 
= 7d - (aV(o E)) dt 4- dW — V - (dq), 
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with dW and dq the elementary work of the forces and heat fluxes detailed later. 
The martingale part of these RHS terms is written F d * dB;. 


Using (70) and the continuity equation (12), we obtain 


t t 
emo) + af pe dB)", | s; (; Fi aB, )) = aw - v tp. 
i 0 o Oxk \p 


(17) 


Definition of the Energy 

At this point, the form of the total energy has to be specified. It is strongly related to 
the physical mechanisms at play. In the present study, we consider the total energy 
pE = ple + 5 llul? + gz), as the sum of internal energy e = T kinetic energy 
and potential energy due to gravity. We do not consider the energy of the Brownian 
motion since it is possibly infinite. 


Definition of the Work of Forces and Heat Fluxes 

The work of the time-differentiable pressure represents how pressure is working 
with the displacement of the control surface. The expression can be obtained by 
integrating the force multiplied by the surface displacement over a transported 
control volume and applying Green’s formulae. The procedure is similar to the 
deterministic framework, with the additional implication of the drift velocity, as 
demonstrated in appendix 7. We have for the pressure work: 


f dW, dx =f (—pndS)- (u*dt + o;dB;) 
Q(t) ôN (t) (18) 
=- f V » (p (u*dt + o;dB;)) dx. 
Q(t) 


The minus sign comes from the outward normal n convention. We can then identify 
dW, = —V - (p (u*dt + o;dB,)). (19) 

In the same way, the viscous stress of the resolved component can be written 
dW, = >v - (t (u) (u* dt + o;dB;)). (20) 


Following Appendix 7, we take as well into account the work of the random 
pressure: 


dW,» = —V - (u*dp?), (21) 


and the work of the random viscous stress 
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dW,, = ly . (r(c;dB,)u*). (22) 
Re 
As rigorously detailed in appendix 7, we do not consider work of random forces 
associated with o';dB;, since such a work would be highly irregular (in time) and 
should be in balance with variations of kinetic energy of o;dB;, which is possibly 
infinite and not described by the present model. 

There is no work contribution of gravity on total energy, since the gain in kinetic 
energy directly associated with the gravity force is compensated by the loss in 
potential energy. 

Finally, we obtain the thermal conductivity by expressing the thermal fluxes by 


the Fourier law dg = — pap VT dt. 


Transport Equation of Temperature 

By replacing the energy by the contributions of internal, kinetic and potential energy, 
and by subtracting the contribution of the kinetic energy using the momentum 
equation (15) and the distributivity rule (70), we obtain the transport equation for 
the temperature 


t t à 
oor + doa f pe dB, f az (ZF: osans) 
y : 0 o Ox, Vp 


~ 


Or 


p — em 
DL (f Fu dB, | Fu sd.) 
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Qu 
= —pV - (u*dt + o;dB;) - dp? V -u* 
————— aM 
P, Ps 


1 1 
+ gU) DM (u* dt + c;dB,) + pe or B:) : Vu* (23) 
e e 


—————— 
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1 
+ ((u* — u) dt c o;dB;) . (-v»* p r T(u) +g) 
a— a oO 
D, 
" 1 
t (u* —u)- (-vavr + me . ris.dB,)) 
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1 
+ ——V.-(VT)dt. 
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with F# = LE" and LF A - dB, the sum of all martingale terms of the RHS of 
Eq. (23). In Eq. (23), we recover the terms present in the deterministic framework, 
but considering the stochastic transport operator instead of the deterministic trans- 
port operator. Nevertheless, some covariation terms are now arising. In particular 
the term Qr is induced by the random work of the forces, and the term Q, is 
induced by the increase of kinetic energy through covariations of the forces in the 
momentum equation. On the RHS, we remark that the drift velocity is involved in 
the work of the time-differentiable pressure (in P;) and random pressure (in P5), 
consistently with Appendix 7. The terms V; and V, are smooth in time and random 
viscous stresses, respectively. In addition, the terms D; and Do correspond to works 
caused by the alignment between the drift and random velocities with the pressure 
gradient and viscous forces. We call them drift works. Focusing on — (u* — u) - V p, 
we interpret this drift work to be related to baropycnal work [1], present in the 
compressible large-eddy simulation framework. Indeed in standard compressible 
LES, baropycnal work corresponds to a contribution caused by the alignment 
between the large scale pressure gradient and the Reynolds stresses induced by 
product between the small scales contributions of o and u (i.e. ivp - p'u', with 


" denoting here small scale components and = large scale filtering). This Reynolds 
stress Lp/u' has the dimension of a velocity. In our case, the interpretation of the 
effective displacement associated with this work is directly the drift velocity over 
dt. Similar interpretations can be made for the other drift work terms, associated 
with viscous stresses and random variables. The presence of gravity in the drift 
work D, shows that in the vertical direction the time-differentiable drift work is of 


the form (w*— wy 2? — pg), and we see appearing the vertical small-scale mass flux 


times the buoyancy (w* — w)(pob), plus non-hydrostatic pressure effects. It can be 
noticed that for a divergence-free homogeneous noise (for which the variance tensor 
is constant in space) the drift work is null as u* — u cancels. 


3.5 Equation of State 


In order to close the system, we have to specify the equation of state. We keep 
generality and write the equation of state formally as follows 


p= f,T). (24) 


As in the deterministic framework, since we have an evolution equation of density 
and temperature, the pressure can be determined explicitly, at the price of a Courant- 
Friedrichs-Lewi (CFL) condition constrained by the speed of sound. 

The random pressure can be identified by differentiating the equation of state. 
Indeed, we have an explicit evolution equation of the pressure, which can be 
expressed through Ito formulae (the equation of state f being deterministic—i.e 
the state map does not depend on the random events) as 
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af af Loy 1957 a’ f 
d,p ——-d d,T d E dur d,(o, T 
tP ap tP + aT t 72395 T P+ T 1 ( , | + 359T t(Pp, ) 
ap dp? 
lb quc. 
ot T 
(25) 


where p is the time-differentiable part of the pressure which contains, among 
other things, all covariation terms. The martingale part of d; p is dp? /t, with t a 
decorrelation time. This decorrelation time represents the typical time during which 
the random pressure acts in a coherent manner to produce a change of momentum. 
It is assumed to be the same decorrelation time than the one classically introduced 
[e.g. 12, 4] to relate in practice the definition of the variance tensor to velocity 


fluctuations variance: (i.e. a = t E (w u' Dy. The term dp? is identified from (25) 


to be the random pressure acting on the momentum equation. 
If we assume that the random pressure ensues from an isentropic process, i.e. of 
acoustic nature, we can write 


dp 
d; p = E 


dp = c^d;p, (26) 
dp 


Ss 


with c the speed of sound and s the entropy. We can then identify from (12) 
dp? = —tc’V - (po,dB,). (27) 


It can be remarked that this expression is consistent with Eq.(25) under the 
isentropic transformation assumption. 

For oceanic flows, the equation of state is often expressed in terms of density 
rather in pressure. A specific treatment adapted to oceanic flows is detailed in 
Sect. 6. 


4 Low Mach Approximation 


To perform the low Mach approximation, we follow the same steps as [11], but 
applied to the compressible stochastic Navier-Stokes equations. With our non- 
dimensioning, we have at infinity for isentropic transformations, 


ð 1 
OP) e 


=c = —. 28 
0p Is "Ww? S 


This suggests for small M the following asymptotic expansion 


Stochastic Compressible Navier-Stokes Equations Under Location Uncertainty 303 
= 2 2 
p = po + M^pi -F o(M^), 
u = uo + o(l), (29) 
1 

T= 45^ + Tj c o(l). 
and p — OGE Pr) = O(1). Similarly, the random pressure dp? follows the same 
scaling as the dine differentiable pressure. 


Collecting O(1) and O(M?) terms in the continuity equation, we obtain respec- 
tively 


V-(ujdt+odB;)=0 ; Dp, =0. (30) 


In the momentum equation (15), the order of magnitude of the covariation 
term can be determined by integrating over the domain, using distributivity of the 
divergence and performing an integration by parts: 


t t 9 1 ad o 1 9 dB 
[ «(f p(o.aB.)', | ( ( Ps 4 ES 2E 
2 k 0 o Ox. Np OX; Re 
ad 1 Ot; ;(o5dB, 
= [Xs f re dB, [C (2 Psy uc. ES 
2 OXk Re 
à m odp? 1 8r; dB 
= / d, Í V - (pa ,dB,). f Bei, p E AS T 
5 0 0 \e dx; | Re àxj 
t t 1 d o 1 . dB 
=) d, I po,dBs «n. f ( ( e Ps + OTij (0s yy) as 
8Q 0 0 N^ Ox; Re Ox; 
j tf] ddp? 1 ð; dB 
-f d; i V-(pa.dB,), | ( ( Ps + Tij (0s 2)) dx 
g 0 0 p OX; Re Ox; 
t ty 
=f d; Í drp | =F?" -g,dB, ) dx = O(M?), 
Q 0 —— Jo p 
— 


O(M?) c OG 


(31) 


where suitable boundary conditions at 942 (e.g. Dirichlet boundary conditions (no 
random inflow velocity) or zero normal stress (outflow boundary conditions)), have 
been applied to insure the first surface term vanishes. 

By neglecting the order O(M?) terms, we obtain then finally the incompressible 
Navier-Stokes presented in [15] under the incompressibility assumption 


1 1 
po Dru = —Vp dt — Vdp? + m - (rz (u)) dt + rr - (r(o;dB;)) + pg. 


V.u* 0 ; V-(o;dB;) =0. 


(32) 
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5 Boussinesq-Hydrostatic Approximation 


In this section, starting from the stochastic compressible Navier-Stokes equations, 
we perform the Boussinesq approximation by considering small density fluctua- 
tions. These fluctuations are neglected, when they are not multiplied by gravity g, 
which leads to the classical definition of the buoyancy. We perform as well the 
hydrostatic approximation through the classical aspect ratio scaling D = H/L. « 
1, with H the water depth. For simplicity, we do not consider a rotating frame. 
Coriolis correction could be straightforwardly considered as in [21]. The vertical 
coordinate z € [—H, n] is bounded by the bottom and the free surface. 


Density 


The density is decomposed through the following asymptotic expansion 


p = po + €pi(z) + epo (x, y, z, t) + o(e), (33) 


with p(z) the time-averaged stratification term, and e < 1 and we do not need to 
assume that p, > p2. We obtain hence 


V.u*—0 ; V.o;dB, 20 ; Dr (pi +2) — 0. (34) 


The drift velocity and the noise are divergence free. Density perturbations undergo 
a stochastic transport by the flow. We remark that since V - o;dB,; = 0, then the 
transport operator ID, (-) can be directly used. 


The terms of order € of Eq. (34) can be expressed in terms of buoyancy b = 
—€ gp2/po: 


dp, 1 ð 
POD b = (w* dt + (6, dB,),) LL — - V - (s. 2) dt, (35) 
g Zz 2 Oz 
with 
a— [o 2 (36) 
a, yT azz 


and a,y7 = aj, for H = (x yy. 


Thermodynamic Effects 

Equation (35) is part of the stochastic version of what is often referred to as the 
simple Boussinesq equations. In the ocean, thermodynamic effects can be important, 
and we propose to incorporate these effects by combining the buoyancy and the 
energy equation, following the steps of [22]. Assuming a linear equation of state for 
sea water, we have 
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p = po(1— Br(T — To) + Bp). G7) 


with 85 — 1/ poc? and Br = 1/ pose the coefficients of the Taylor expansion. For 
sake of simplicity, we do not take into account salinity effects, and we apply the 
stochastic transport operator to Eq. (37). We obtain 


1 
ip = —poBrDT + p tP 


1 
(o a) - -E av «o. 
c y 


(38) 


With no viscosity, divergence-free velocity and neglecting the quadratic variations 
(with the same argument as in (31)) together with the hydrostatic assumption on the 
leading term po, we can assume that the main dilatations are caused by radiative 
effects to which the potential buoyancy bg is directly sensitive: 


g pose z 
b 2-4 (s a) =o Hee 39 
o p p +t 2 st + EH, (39) 


with H, = c?/g and bs; = —€ gp1/po. Upon applying the transport operator (with 


forcing), the following evolution equation of the potential buoyancy is obtained 


D,by = r ao 4 dW3), (40) 


where dQ = dQ. + g do; V - (VT)dt, with dQ,,, the radiative heat fluxes, and 
dWy = ((u* — u) dt + o;dB;) -(CVp + pg)— (u* — u) - Vdp? 


the drift works. The drift work on the vertical velocity component can be interpreted 
(with a linear equation of state) as an alternative to the so-called eddy diffusivity 
mass flux (EDMF) scheme proposed recently for atmospheric and oceanic penetra- 
tive convection parameterization (see for instance [19, 10] and references therein). 
Indeed, in EDMF, the subgrid stress in the transport equation of temperature is 
modelled as a mass flux induced by a given number of plumes (corresponding here 
possibly to po ;dB;), multiplied by the difference of temperature between the plume 
and the ambient flow, which is here proportional to a buoyancy anomaly. Inter- 
estingly, the pressure work provides a natural non-local (horizontal and vertical) 
forcing term while the other term is a local upward/downward vertical statistical 
forcing. EDMF schemes are obtained by specifying the noise in terms of velocity 
fluctuations between the mean velocity and non-convective environment, upward 
plumes and downward plumes. Such an interpretation need to be tested with 
numerical simulations, and will be the focus of a future dedicated study. 


306 G. Tissot et al. 


By defining the buoyancy frequency 


a 2 
pı -)= 539m _ 8 (41) 


ð 

N? (z) & ( € ; 
@) az S on EH, po Oz c 

stratification and radiative effects can be introduced explicitly on the buoyancy 

equation 


l 
ib + (w* dt + (01dB1):) N? — V (a.n?) dt = Lag +awD. (42) 
0 


yp 


Momentum 

Concerning the momentum equation, we neglect here the viscous terms. In this 
framework, g is assumed to be O(1/e). We decompose as well the pressure field 
as follows 


p = po(z) tpi + pa + Of), (43) 
——— 
OG) 
where po and p; are in hydrostatic balance: 


oP 
Oz 


The momentum equation (15) becomes 
: ! 9 1 ddp? 
D,u;—- > d ast, | —[——3 
(po + <(o1 + -2)) "i » Af pos 2! TA (- Bx, ) 


€p» gài; dt. 


Opi 
—gpo and a. cen. (44) 


(45) 


Similarly as in Sect. 4, the covariation term is O(e). By neglecting Oe) terms, we 
obtain 


19 1 ddp? 
aps P2 ar PE e gS; dt. 
po OX; po dx; po (46) 
—— 
b 


Finally, p2 and the random pressure are determined through a generalisation of the 
hydrostatic balance, accounting for a part of non-hydrostatic effects by balancing 
in the vertical momentum equation the vertical pressure gradient with buoyancy, 
stochastic diffusion, corrective drift and stochastic advection of w. We consider 
a regime where the hydrostatic approximation in the deterministic framework is 
only roughly valid (in other words at the limit of validity), such that a noise with a 
strong amplitude can break this assumption—or changing viewpoint, the regime is 
intermediate and we aim at modelling some weak non-hydrostatic effects through 
stochastic modelling. By scaling analysis (weak aspect ratio and noise with strong 
amplitude), d;w and (u - V) w are neglected while terms associated with the noise 
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are kept. Indeed, denoting L? the scale amplitude of o;dB;, and t the decorrelation 
time, the advection of o;dB, cannot be neglected if! L° [Lye ^ 1/(Fr D)? 
and stochastic diffusion and drift velocity are important if (L7 / Lyc [Trt ~ 
1/(Fr Dy?, with the Froude number Fr = wu/(N H). Since dw is neglected, 
martingale and time-differentiable terms can then be safely separated, such that the 
remaining pressure term can be determined by vertical integration through a scheme 
similar to the one applied in the classical hydrostatic regime: 


n 1 1 
p= f ((5v a v) 5v avo - 0) dz 


" (47) 
dp? = -o | (c;dB, - V) w dz. 


Here, we have neglected d;w, but random vertical transport could generate 
some random vertical acceleration instead of only random pressure fluctuations. 
An intermediary assumption could be to consider that the time-differentiable part of 
d; w is negligible (classical hydrostatic balance), but that its martingale part is not. 
It could be obtained by diagnosing the vertical velocity time increment and thus 
bringing and additional correction to dp? in Eq. (47). 


Summary 
By collecting the Eqs. (42), (46), and (47), we obtain the following stochastic 
Boussinesq system with thermodynamic forcing 


V-*(o;dB;) = 0 


n 1 1 
po =o f ((5v a v) e 5v avo -») dz 


n 
dp; = -» | (o;dB, - V) wdz 
z 


1 
Dib + (w* dt + (6 4B,).) N? - 5V (asn?) ar = ÉÉ* ao + awa) 


Ypo 


1 
dO = dQ,, + —V - (VT) dt 
Q=d0u+t pon, T) 


1 1 
dW, = (sv -adt — 2 (Vp2 + pobez) + 5V «a Vdp? . 
(48) 


! Tf the frame rotation is taken into account, the ratio R,/Bu Rossby over Burger is additionally 
involved, but does not change the existence of an intermediary regime. 
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In system (48), the pressure is obtained through a relaxed hydrostatic balance, and 
the vertical velocity is deduced kinematically from the divergence-free condition 
of the drift velocity. Neglecting the thermodynamic effects, together with a strong 
hydrostatic balance assumption (weak to moderate noise regime), we recover 
the simple Boussinesq system presented in [16], without the Coriolis correction. 
Obviously, this latter could be added without any major difficulty. 


In some applications, a more accurate evaluation of the buoyancy is required, 
and it can be obtained through an equation of state o&, (T, p) (salinity is not taken 
into account here and left for future works) associated with a transport equation of 
temperature (and salinity when considered). Under the aformentioned assumptions, 
the transport equation of temperature (23) is simplified, and the full system can be 
written 


18 1 ddp? 
)iuj = pa Pı for i= {u,v} 
po OX; po Oxi 
* füu*  Qv* 
= —(V-a)- d 
uw cil B z 
V -(0;dB,;) = 0 


" 1 1 
p= eo f (5v a v) w+ 5v vo -») dz 


á (49) 
dp? = - | (c;dB, - V) wdz 
Z 


1 1 
PDT = (5v -adt — o,4B, ) * (V p2 + pobe;) + 2 -a- Vdp? 
y 


1 
—— -V .(VT) dt + dQ, 
+ Bopp VT) dt dO, 


g 
b = —— pso(T, — p082). 
po 


Usually, a simple stochastic advection-diffusion equation is considered for the 
transport of temperature, but in the system (49), we can point out that the drift works 
remain. These source/sink terms in the temperature evolution equation is one of the 
principal outcome of this study. As outlined this additional terms for parameterising 
discrepancies to hydrostatic physics and primitive equations. In the next section, we 
explore systems at finer resolution. 
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6 Extension to Non-Boussinesq 


The aim of this section is to propose a formulation to relax the Boussinesq 
assumption in the LU stochastic framework while avoiding the resolution of a 3D 
Poisson equation. We consider now an intermediate model between the fully non- 
Boussinesq non-hydrostatic formulation and the system (49). For sea water, the 
equation of state is formulated in terms of pg9(T, p) instead of p(p, T) as in gas 
dynamics. To take this aspect into consideration, we follow [3] in order to obtain 
an explicit expression of the pressure. This is at the cost of resolving in time sound 
waves, or a pseudo-compressibility information propagating at the velocity c. The 
density is decomposed as 


0 
p = pu(T, p) + ap? +0 (6p), (50) 


—— 
5p 


with o&,(T, —pogz) the Boussinesq density determined by the equation of state 
under an hydrostatic balance condition. The deviation to this density is then assumed 
to be ensue from an isentropic transformation, i.e. of acoustic nature. The term 
9p _ı 


acum is then directly related to the sound speed (or more precisely to the fastest 


wave considered in the model). We determine now a transport equation for óp. 


Continuity 
We start from the continuity equation of the stochastic compressible Navier-Stokes 
equations 


dp 4 V- ((« = v .a) dt + dB) P) = jV - (aV p) dt. (51) 


A transport equation for óp can be deduced as 


1 
d; (60) = — dr (Ps9) — V - ((« uii zB) (so + ao) 
(52) 


1 
+ 2M - (aV pao + 6p) dt. 


At the scales considered here we assume that the unresolved contribution is of 
hydrodynamic nature associated to a divergence free noise V - (o;dB;) = 0. 


Momentum 
The pressure can be decomposed as well as 


n 
P = Pan + f Poo(z)g dz/ + Pru + c?8p, (53) 


zZ 
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where Pam is the atmospheric pressure, which will be neglected later for simplicity. 
The second term is the hydrostatic pressure associated with the Boussinesq density. 
The third term, py, is associated with Boussinesq non-hydrostatic effects balancing 
vertical advection, and finally c?6p corresponds to a non-Boussinesq component of 
acoustic nature. 


For the martingale random pressure, two components are considered: a Boussi- 
nesq non-hydrostatic term as in Eq. (48), and a non-Boussinesq component of 
acoustic nature as in Eq. (27) 


] 
dp? = - | (c;dB, - V) wdz' — tc? (o;dB, - V) p. (54) 
z 
Neglecting the viscous terms, we obtain for the momentum equation 
i . ft a (1 adp? ap adp? 
D,uj — » d «dB. | £j dt L + pg. 
p Drui 2 Af p(osdBs), fa (- P ) pm E 


(55) 


Assuming that o Djui ^: Peso Drui, and following the same arguments as in Sect. 4 
to neglect the quadratic variation term, one finally get 


Op ddp 
Peo Vtu = ET dt — de 
Op ; ddp, 
v= —— dt — 
Peo Vt zm ay 
ð 0dp? 
Pro Drw = ES dt — a + (Psa + óp)g dt 
əz Oz 
A / 1 1 1 2 
p= (Pao(Z ) + 60)8 + Pro gy Rd wa GV (av) dz + côp 
FA 


n 
dp? = - | (c;dB; - V) wdz' — tc? (o;dB, - V) p. 
g 


(56) 
The system (52)-(56) can be solved explicitly and does not require the expensive 
resolution of a 3D Poisson equation. Although system (49) proposes a deviation to 
the hydrostatic hypothesis through the martingale random pressure, the system (56) 
considers a non-hydrostatic model that fully accounts for stochastic vertical accel- 
erations while relaxing the effect of fast waves truncation through the martingale 
pressure term. This system remains restricted by a CFL condition depending on 
the propagation speed of pseudo-compressibility informations. We believe this 
modelling strategy opens some new research directions on the role of unresolved 
small scales on non-hydrostatic and non-Boussinesq effects in oceanic flows. 
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7 Conclusion 


This paper proposes a stochastic representation under location uncertainty of the 
compressible Navier-Stokes equations. It as been obtained from conservation 
of density, momentum and total energy, undergoing a stochastic transport. The 
structure of equations remains similar to the compressible deterministic case. 
Nevertheless, because of the specificities related to stochastic transport, we have 
identified additional terms such as work induced by the alignment between the time- 
differentiable pressure gradient and the drift velocity. This small scale induced work 
is alike the baropycnal work known in compressible large eddy simulations and 
includes also terms reminiscent to mass flux parameterisation of atmospheric and 
oceanic penetrative convection phenomenon. These terms are obtained by the mean 
of a rigorous derivation from the conservation laws coupled with stochastic calculus 
rules associated to stochastic transport, instead of phenomenological arguments. 

We have verified that applying low-Mach and Boussinesq approximations on 
the stochastic compressible system enabled us to recover the known incompressible 
and simple Boussinesq stochastic systems respectively. The general set of stochastic 
compressible equations allowed us to incorporate thermodynamic effects on the 
Boussinesq system. Finally, this formulation has lead us to propose a way to 
relax the Boussinesq and hydrostatic assumptions. This study opens some new 
research directions to exploit the potential of stochastic modelling for the numerical 
simulations of oceanic flows we will exploit in future works. 
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Appendix A: Stochastic Reynolds Transport Theorem from 
Stratonovich to Ito 


The aim of this section is to rewrite the stochastic Reynolds transport theorem 
with a Stratonovich convention, and verify that we get the same relation as the 
ones obtained in Sect.2 in the Ito setting. To that end, we follow the steps of 
[16, Appendix D] but we use the Stratonovich convention. Then, we pass from 
the Stratonovich to the Ito form. These forms are equivalent for regular enough 
processes, thus allowing us to verify the consistency of the Eq. (9). 

To pass from Ito to Stratonovich, we use the following relation 


1 t t 
X,odY, = X,dY, + 5d; (| ax, f a). (57) 
0 0 


We define the characteristic function $ (x, f) transported by the flow, such that 
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$(X;(xo)) = g(xo), (58) 


with a compact spatial support V(t) of non-zero values that does not include points 
on the domain boundary. We can then write 


d / Goe t) dx —d f (qd), ) dx 
V(t) Q (59) 


a dioq à + q dio dx. 
Q 


Since $ is transported, we have 


d$ (X,, t) = dod+Vod dX, = 0 


1 1 (60) 
dog + (u — a poy canis VO (V6 - o,)odB, = 0. 


We have then 
d Jj (qd)(x, r) dx 
V(t) 


1 1 
= f s0g0-4(w- 2/79 jn +01) -V0 (6o )odB,) dx 


=f [aroa +V. (« (v = ly eat luy * 0,)) dt +o,0aB,))| $ dx. 
R 2 2 
(61) 


We add now a force and obtain the SRTT in Stratonovich form: 


1 1 
doq +V è (« (o — -V rat sar *0;)) draoa, )) = Qi dt+ Q,0dB;. 


2 
(62) 
Let us now write this in Ito form: 
ð j 0 ; 
au (ac..ijodB]) BT (ac..i;dB1) 

1 t à j ; (63) 

-d d; | — T dBi). 

+ 2 Af t (s. (42.,))) 2i j 

—————M—— 


Since o; is time differentiable in the Eulerian grid, we have 


t t ; 
Zu dios. | api) - o 
0 0 
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Then, 


ME 
(f (E m tm) [a 


X 
0 A 0 oq 0 90;,1j 
zr (oloni) dt — au (Zoos) dt — Ox: (esu) dt 


=V - (o; Q,) dt — V - (aVq) dt — V - (qos (V -a,)) dt. 
(64) 


In addition, we make the hypothesis that dQ, is time-differentiable in the 
Lagrangian frame, such that we have 


A . 5 1 : 
af ghar = f d; QJ. + V - (Q7 (u* dt + o;dB;)) + =V - (aV Q2) dt dx 
V(t) Y) 2 


zi F dt dx. 
V(t) 


We can then write 


1 t . t 1 
Q, odB, dx =f Q,dB; + =d; (| d, o. [ api) dx 
va) vo) 2 0 0 


1 , j Bpi 
— Q, dB, = —-d; (| V * (Qo. aB.). | api) dx 
vo) 2 0 0 


1 ta rss per 
vit) 0 OX; 0 


1 ə ; 
= dB, — -— (Q! jj) dt d. 
on Qo t 2 üx; (Q50:, ji) x 


(65) 


1 
= Q, dB, — , V (e: Q,) dt dx. 

V(t) 
(66) 


Assembling everything and dropping the space integral, we obtain 
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1 1 
dioq +V. (a (o — ay “at gril tear ouod.) 


= Qi dt = Q,0dB, 


1 1 
1 
T5 [V - (c; Q,) dr — V - (aVq) dr — V - (qa; (V -o;)) dt] 


1 
= Qi dt — OQ, dB, + ay * (0; Q,) dt. 


We note that d;oq = d;q since 1d, J ds, IA a) = 0. After simplification, we obtain 


dq +V- (« (u — jV -a) ar +0/4B; ) ) +V-+(0;Q,) dt 
(68) 


1 
= 53V: (aVq) dt + Or dr + Q,dB,, 


which is exactly Eq. (9). 

To obtain Eq. (68), we had to assume that dQ, is time-differentiable in the 
Lagrangian frame, which renders the demonstration slightly more restrictive con- 
cerning the shape of the forces. We do not have to perform such an assumption in 
Ito form, and we consider the present appendix as a sanity check of Eq. (9). 


Appendix B: Calculation Rules 


Distributivity of the Stochastic Transport Operator 


The distributivity of the stochastic transport operator is detailed in this section, in 
the case where the stochastic transport operator is balanced by random RHS. If the 
evolution of two variables f and g are given by 


(f — F,dt+ Fo - dB, 
t8 = G; dt + Gs -dB;, 


(69) 


we have then 


1 Cfg) — f Digts D; f+F o+Go dt-(o, (F5) - V) gdtor(Go)-V) f dt, — (70) 


or less formally: 
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D,(fg) =f Dig + 8 D: f 


t t 
T d; (/ F, an, | Gs aB.) 
0 0 
t t (71) 
-«(f Fo -dB.. | (s.d, -V)«) 
0 0 


t t 
— d, (| Go -aB., f (osdBs - V) fh 
0 0 


This relation is useful to transform the conservative form of the Navier-Stokes 
equations into non-conservative form. 


Proof 
D,(fg) = 
1 
d,(fg) + (u* - V) (fg) dt + (0 dB; - V) (fg) — zV AVe) dr 


= fdg + gd: f +d: (f, 8) 


+ f (n V) gdi +g (u* V) fdt + 5v : (avg + gaV f) 


= fdg + gd; f 
t 


t 
+a f = (e.4B, V) f + Fo -dB.. | 
0 0 


+ f (u* - V) gdt + g (u* - V) f dt 


— (o5dB; - V) g + Go «4n. 


1 
- (fY - @V8) + aV) -V) f + eV - (aV f) + (aV f) - V) g) ar. 
(72) 
Developing only the covariation term: 


t 


t 
& (f - osa. V) f+ Fo ans f 
0 0 


((aV f) - V) g dt 


t t 
«(f Fo dB, | Ga -aB. 
0 0 
t t 
-a{ f F, -dB,, Í (o.aB,-¥) 8] 
0 0 
t t 
-«(f Go -aB.. | (s.dB, V) f). 
0 0 


— (o,dB, - V) g + Gc * ap.) = 


(73) 
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whose right hand side is written formally 
(aV f - V) gdt + Fo + Go dt — (0;(Fo)+V) gdt — (o,(Go) -V) f dt. (14) 


Substituting (74) into (72), we obtain Eq. (70). 


Work of Random Forces 


For sake of clarity, we first detail calculation rules in the context of point mechanics 
to define the work of random forces. We consider a random force, whose impulse is a 
martingale dF. We write its elementary work in a weak sense for any differentiable 
function $ (t) such that $ (0) = $ (T) = 0: 


T T a t 
f $ (t)dW, =f p(t) èf aro ) dX. (75) 
0 0 ot 0 


The random force 2- [^ dF, is written here formally, since dF, is a martingale 
and cannot be differentiated in time. The work in Eq.(75) can be split in two 
contributions: dW,.,, associated with the displacement u dt, and dW, 4 associated 
with the displacement o; dB;. We treat them separately. 


T T a t 
f o(t)dWou zi (t) (=f ar.) -u(x, t) dt. 
0 0 ðt Jo 
T / pt 
E: -f (/ ar, ) - (6 Oulx, t)) dt. 
0 0 


The last expression is well defined and is a proper way to write this term. We can 
remark that Ju dF, is homogeneous to a Brownian. If we expand the following 
expression for a time-differentiable function v (x, t) 


T t T t T 
1 a(vo.n- f ar, ) a (/ ar, aveo f W(x, dF, 
0 0 0 0 0 
T t T 
=} (J ar, wena f y, t)dF,, 
0 0 0 


(77) 


(76) 


by taking w(x, t) = $(t)u(x, t), we obtain 
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T T T t 
i $(1)dWo y= I é(DuG., DdF, — f a (oto. f ar) 
0 0 0 0 


T 
= 1 $(t)u(x, t)dFo — $(T)u(x, T) - Fo(x, T) (78) 
0 


T 
= f o (t)u(x, t)dFo. 
0 


We can identify 
dWs,, = u(x, t)dF,. (79) 


The second term dW,., is not well defined, even in weak form. Informally, it 
should balance with kinetic energy of o;dB;, which is not well defined (possibly 
infinite), and which has not been considered in the definition of total energy. 
Discarding this term is consistent with the derivation of the momentum in [15], 
where the acceleration associated with o;dB, being highly irregular is assumed 
to be in balance with some forces components which are equally irregular. As a 
consequence in our model, there is no work of the random forces associated with 
the Brownian motion displacement of the control surface. 


Appendix C: Displacement of a Transported Control Surface 


Let us apply the SRTT to a characteristic function (q = 1 in 2 (t), q = 0 outside) 
transported by the flow [16], and use the divergence theorem. We obtain the volume 
variation associated with a control surface transported by the stochastic flow. 


avo =a f idx f V - (u* dt + o,dB,) dx 
Q(t) R(t) 


(80) 


ii (u* dt + o;dB;) nds. 
62(t) ———————— 
Xa, 


Hence, the normal displacement of the control surface is dX g_;+n, which involves the 
modified advection velocity. As a consequence, the modified advection velocity has 
to be considered for the definitions of elementary works based on surface integrals. 
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Data Driven Stochastic Primitive A 
Equations with Dynamic Modes or | 
Decomposition 


Francesco L. Tucciarone, Etienne Mémin, and Long Li 


Abstract As planetary flows are characterised by interaction of phenomenons in a 
huge range of scales, it is unaffordable today to resolve numerically the complete 
ocean dynamics. In this work, a stochastic version of primitive equations are 
implemented into the NEMO community ocean model to assess the capability of 
the so-called Location Uncertainty framework in representing the small scales of 
the ocean flows. 


Keywords Stochastic parametrization - Ocean modelling 


1 Introduction 


Numerical resolution of planetary flows is nowadays a key tool to investigate 
possible future climates. The Ocean is a key actor of climate regulation and its 
evolution is for that reason of major interest. High resolution simulations are 
however extremely expensive and their usage remains limited to small domains. 
Large-scale simulations remain the primary tool to investigate future states of 
the Ocean (and of the Atmosphere as well). These simulations however do not 
resolve the complex interdependence of mesoscale and sub-mesoscale dynamics 
that characterises the global circulation, and thus great care must be put in the choice 
of the parametrization of all the scales that are too small to be efficiently resolved. 
Recent approaches incorporate noise terms to the dynamics of the flow with the goal 
of modelling the unresolved (and parametrised) processes, including small-scale 
turbulence, boundary value and scale coarsening uncertainty, as well as discretiza- 
tion and numerical errors. Rigorously justified methodologies have been introduced 
by Mémin [1] and Holm [2], providing a theoretically consistent stochastic large 
scale representation of the Navier-Stokes equations [3] conserving either energy 


F. L. Tucciarone (54) - E. Mémin - L. Li 
INRIA Centre de l’ Université de Rennes, UMR CNRS 6625, Rennes, France 
e-mail: francesco.tucciarone @inria.fr 


© The Author(s) 2024 321 
B. Chapron et al. (eds.), Stochastic Transport in Upper Ocean Dynamics II, 
Mathematics of Planet Earth 11, https://doi.org/10.1007/978-3-031-40094-0 15 


322 F. L. Tucciarone et al. 


or circulation, respectively. Both models rely on a stochastic decomposition of the 
Lagrangian trajectory into a smooth-in-time component induced by the large scale 
velocity and a random fast-evolving uncorrelated part. From such models, a large- 
scale representation with a stochastic definition of the small-scale effect emerges 
naturally. Moreover, compared to classical large-scale deterministic modelling, the 
additional degree of freedom brought by the stochastic component allows us to 
devise new intermediate models [4, 5, 6, 7, 8]. The Location Uncertainty (LU) 
approach [1] has been tested within the barotropic quasi-geostrophic model, the 
rotating shallow water model and the surface quasi-geostrophic model, where it has 
proven to be more accurate in structuring the large-scale flow [4], reproducing long- 
term statistics [9] and providing a good trade-off between model error representation 
and ensemble spread [10, 11] This work investigates the benefits of such model in 
the hydrostatic primitive equations, following the work of [12] in which noise based 
on Empirical Orthogonal Functions (EOF) was proposed. Here, a more elaborate 
noise defined from a Dynamics Mode Decomposition (DMD) strategy is proposed. 


2 Location Uncertainty (LU) 


The Location Uncertainty principle consists in describing the trajectory X; of a fluid 
particle with a stochastic decomposition of the Lagrangian trajectory, represented 
with the following stochastic differential equation (SDE): 


dX, = v (X;, t) dt + o (X;, t) dB;, (1) 


where X: 2 x IR* — Q@ is the fluid flow map, i.e. the trajectory followed by fluid 
particles starting at initial map X|,-9 = xo of the bounded domain 2 C IR?. The 
first component, v (X;, t), acts as the smooth-in-time component of the (Lagrangian) 
velocity of the flow, which is correlated both in space and time and associated with 
the integration of the equations of motion. The second component, o (X;, t) dB;, 
is a stochastic contribution (referred to as noise) that accounts for the processes 
that cannot be resolved at a given resolution or that have been neglected through 
a given numerical or physical modelling approximation. To completely define this 
last component, let H be the Hilbert space, H — (L? (S), R^), the space of square 
integrable functions over S with value in IR?, with the inner product (f, g)j = 
Is(f'g) dx and induced norm || fllg = (f. f)H, and let T be a finite time, 
T < +00. In this framework {B;}o<;<r is defined as an H —valued (cylindrical) 
Brownian motion [13]: 


B, = Y fe Q) 


ieN 
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where (e;);enN is a Hilbertian orthonormal basis of H and (Bien is a 
sequence of independent standard Brownian motions on a stochastic basis 
(2, F, (-1)repo, rj, IP). The noise is then properly defined as the application of 
an Hilbert-Schmidt symmetric integral kernel o; f (x) = f. S ò (X, y, t) f (y) dy to 
the H —valued cylindrical Wiener process B as 


(c;dB;) (x) = T Sik (x, y, t) dB. (y) dy, (3) 


where the Einstein summation notation is adopted. The role of the integrable kernel 
o is to impose a fast/small scale spatial correlation. It leads to the covariance 
tensor Q 


Qij (y. t, 5) = E [ (dB, (x))' (4B, (9) | 
—(t—s) ar f Oik (X, Z, t) On; (Z, y, S) dz, 
Ss 


of the centred Gaussian process ø ;dB; ~ V (0, Q). The diagonal components of the 
covariance tensor per unit of time, defined as a(x, £)8 (t — t’)dt = Q(x, x, t, t^), are 
referred to as the variance tensor. This tensor provides a measure of the strength of 
the noise. Notably, the variance tensor has the dimension of a viscosity in m?s~! and 
is symmetric and positive definite. The operator a is a compact auto-adjoint positive 
definite operator on H, that admits hence eigenfunctions &,, (*, t) with (strictly) 
positive eigenvalues A, (f) satisfying J ^, cy An (f) < +00. As a consequence, 
the noise and the variance tensor a can be expressed (with another sequence of 
independent standard Brownian motions) through the spectral representation 


o dB; (x) = D> AN? O & (X, t) dB, (4) 
nelN 

axt) = DAME, @& NED). (5) 
nelN 


This noise term is centred, however a modification can be applied in order to 
consider a Lagrangian displacement of the form 


dX, = [v (X, t) — o; Y, (x)] dt + 0 dB, (X;). (6) 


In contrast with (1), this decomposition sees the contribution of a centred Wiener 
process B; that is drifted by a correlated component o; Y, (x). A proof of this 
statement can be given with Girsanov theorem, as a new probability measure P can 
be built in such a way that a non centred Wiener process as 
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t 
B, =B, + | Y, ds, (7) 
0 


where {Y t} is a random process shifting the process B,, remains centred on 
(2, .F, P, {F:}o<:<r}. The new definition of the noise reads then 


o dB; (x) = o;dB, (x) — oY; (x) dt, (8) 


with {By}; a P—Wiener process. All the arguments provided in the following will 
hold for this process under P, but the usage of a drifted noise is of paramount 
importance when the phenomenon to be modelled displays a non-zero time average 
and the physical processes involved cannot be regarded as completely uncorrelated, 
like in the case of ocean eddies and gyres. In the following, Eq. (6) will define our 
Lagrangian trajectory and the tilde notation will be dropped for simplicity. 


3 Stochastic Boussinesq Equations 


Within the Location Uncertainty formalism the evolution of a random tracer q 
transported along the stochastic flow is described by the stochastic Reynolds 
transport theorem, introduced in [1]. The rate of change of a scalar q, integrated 
within the volume V;, is described by 


af q (x, t) dx = f [D;q + qV - [v* dt + odB;]} (x, t) dx, (9) 
V, V, 
and summarised by the operator 
» 1 
Dig =diq + [v dt +o; dB; ] - Vq — aV EY dt. (10) 


In this formula, the first component of the right-hand side is the increment in time 
at a fixed location of the process q, that is drq = q (X;, t + dt) — q (Xi, t), playing 
the role of a derivative in time for a non differentiable process. Encased in the 
square brackets there is the stochastic advection displacement, composed of a time 
correlated modified advection v* and a fast evolving, time uncorrelated noise ø; dB;, 
both advecting the scalar g. Under the probability measure P the velocity v* is 
defined as 


1 
WY mo Vinto, (Vey) ait, (11) 


where v is the resolved component of the velocity, Vs = IV - a is the effective 
transport velocity resulting from the noise inhomogeneities and the last term is 
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the Girsanov drift due to a non centred noise. With this operator it is possible to 
formulate the Boussinesq equations under location uncertainty as done in [12] and 
reported below, split into horizontal and vertical equations using the convention 
v = (u, w) and with the buoyancy defined as b = — o 


Horizontalmomentum : 


1 
D;u+ fes x (war joan) =V, (—p'+5V v) dr-V,dp? — (12) 


Verticalmomentum : 

Dw = = (7p e zv. v) dr — apf + bai (13) 
Temperatureandsalinity : 

D,T = kr AT dt, (14) 
D, S = ks AS dt, (15) 
Incompressibility : 

V.[|v-v]|20,  V-.o;dB|i2O0, | V-.o,Y,—O (16) 
Equationofstate : 

b —b(T,S,z). (17) 


In this formulation Temperature T and Salinity S are introduced as active tracers 
and transported along the stochastic flow, thus impacting the stochastic transport 
of momentum through the equation of state. The term dp? in Eqs. (12) and (13) 
is a martingale term representing (under the measure P) a zero-mean turbulent 
pressure related to the noise, termed stochastic pressure. From this starting point, 
Primitive equations can be achieved through a hydrostatic hypothesis on the vertical 
acceleration equation, resulting in the two conditions 


=0, (18) 


the first one being the usual hydrostatic balance, the second being the result of 
the uniqueness of the semi-martingale decomposition. In this work and within the 
scaling used in [12], the stochastic pressure is constant along depth and is supposed 
to be in a pure geostrophic balance [10, 14], thus not impacting the rate of change 
of momentum. In this hydrostatic setting the vertical component of momentum is 
a diagnostic variable computed through integration of the incompressibility condi- 
tion (16), and the large scale pressure p’ is obtained through vertical integration of 
the buoyancy term. 
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4 Methods 


The proposed method is implemented in the level-coordinate free-surface primitive 
equation model NEMO [15] in a double-gyre configuration consisting of a 45° 
degrees rotated beta plane centred at ~30°N, 3180km long, 2120km wide and 
4km deep, bounded by vertical walls and with a flat bottom. Seasonal winds and 
buoyancy changes are imposed as external forcings to induce the creation of a 
strong jet that separates a cold sub-polar gyre from a warm sub-tropical gyre. The 
complete details of this configuration are given in [16, 17], and the parameters of 
the simulation were chosen accordingly to the reference papers. The only change is 
in the values of eddy viscosity and diffusivity, enhanced of a factor five to suppress 
aliasing in the salinity field observed for smaller values (see Table | for an overview 
of their values). In order to assess the benefits brought by this stochastic approach, 
two purely deterministic simulations at different resolutions, 1/27° (R27d) and 1/3° 
(R3d), are compared to a stochastic simulation at 1/3° (R3LU). Each simulation 
consists of 10 years of data, collected every 5 days and averaged over the 5 days. 
The R27d simulation has been spun-up for 100 years before collecting data for the 
LU framework. Similarly, an additional 1/9° deterministic simulation has been spun 
up for 100 years in similar conditions in order to construct an initial state for the 
deterministic and stochastic 1/3° simulations. 


4.1 High Resolution Data Filtering 


The high resolution data used to force the low resolution stochastic model need to be 
filtered before being used, in order to avoid the injection of energy scales that can 
jeopardise the stability of the simulation. The low resolution velocity fluctuations 
are obtained through spatial filtering of high resolution temporal fluctuations. First, 
a time average is defined on the high resolution fields as 


1 
i’. = z Í uj, (X, t) dt, (19) 


Table 1 Parameters of the model experiments 


R27d R3d R3LU 
Horizontal resolution 1/27 (3.9 km) 1/3? (35.3 km) 1/3? (35.3 km) 
Horizontal grid points 540x810 60x90 60x90 
Vertical levels 30 30 30 
Time step 5 min 20 min 20 min 
Eddy viscosity —5x107? ms! —5x10-? m*s! | —5x107!? m*s^! 
Eddy diffusivity —5x10-10 m^s-! | 1500 n?s^! 1500 m?s-! 
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so to obtain with Reynolds decomposition the high resolution fluctuations: 
uj, D = ug (x, 0) — ur, (x). (20) 


The corresponding low resolution fluctuations are obtained through a band-pass 
filter as ua = (61 — Go) as where G represents a Gaussian filter. These filtered 


fields have a smaller amount of energy compared to the original snapshots, and they 
are re-scaled to this amount as 


Is], 
in = Gi-Gr) Uy) > (21) 
Um [e-u] | [01-9 ue] 


/ 
Ui 


where the downward arrow represents downscaling towards low resolution. The 
result of this procedure sees the velocity fluctuations have the same spatial structure 
as before but enhanced level of energy. 


4.2 Off-Line Noise Modelling Through DMD 


Dynamical Mode Decomposition is a methodology [19] to construct a proxy linear 
dynamical system to describe an unknown non-linear dynamics. In this paper 
DMD is applied to the evolution in time of the velocity fluctuations, that is thus 
approximated as 


u (x, fj41) ~ Au’ (x, ti). (22) 


Such (finite dimensional) linear dynamical system is known to have a general 
solution: 


N 
w (x,t) = > bm exp (Umt) Om (X) (23) 


m=1 


where $,, (x) € C are the eigenvectors of A associated to the eigenvalues um € C 
and bm € C are amplitudes. In particular pm = Om + icy, the real part om is the 
growth rate of the mode and o, is the periodic frequency of the mode m. Since the 
initial data are real valued fields, the eigenvectors, eigenvalues and amplitudes will 
be two-by-two complex conjugate, that is $5, = $»,,,. Following the successful 
proposition of [18], we split the DMD modes into correlated and uncorrelated modes 
in order to define the Girsanov drift through the slow component of the dynamics 
and the random noise through the fast component. The two sets of modes, M" for 
the uncorrelated noise and M“ for the correlated part are defined as 
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Fig.1 Illustration of the selection of the DMD modes. On the left, frequencies of the modes are 
plotted on the unitary circle; they are coloured differently to represent their characteristic physical 
time scale. At this point, a threshold te = 25d is chosen to differentiate the correlated from the 
uncorrelated modes. On the right, over violet background are plotted the correlated modes, over 
orange background the uncorrelated modes. The amplitude threshold for the correlated mode C^ 
is set to zero, while for uncorrelated modes C" is set to 2. The grey dots represent the set of 
uncorrelated modes below this threshold, that are thus discarded 


Cc 


T 
M" = In eu. t [Amd ~ 1, [oml > Lc [bml so (24) 


T 
Me - nen | [Uml ~ 1, |om| € — ml > c. (25) 


c 


where Te is a temporal separation scale between correlated and uncorrelated 
(usually set to a value for which a spectral gap is observed and fixed here 
to twentyfive days) and C", C* are empirical cut-off of amplitudes. A visual 
representation of the aforementioned procedure is given in Fig. 1l. As the DMD 
modes are not orthogonal, a scaling is applied to avoid spurious effects and to make 
sure that the reconstructed data corresponds to an orthogonal projection onto the 
subspaces spanned by the set of modes contained in M” and M°. The procedure 
reads as follow: 


— Construct the Gramian matrix G of components 8m,n = (Ọm, Øn) 7, With m,n € 
M; 

— Invert the Gramian matrix and compute the dual set of modes 6* = G7!®; 

— define the amplitudes as the initial state data on the dual set of modes: g,, = 


(u (x, 10), On) um- 
Such procedure is applied for M = M" and M = M¢ separately. 
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4.3 On-Line Noise Reconstruction 


Inside the NEMO core, during the simulation, the noise and Girsanov drift are 
defined as: 


c,9dB, = ty Y. exp Gont) Om (X) dBm, (26) 
meM*# 
oY, =U'+ M exp Gont) Pm (X), (27) 
me. Mc 


with Em = J/t09,,. In Eqs. (26) and (27) tg is the process decorrelation time. It is 
supposed to be different for each component evolving in the system, as momentum, 
temperature and salinity do not diffuse with the same decorrelation time. The 
subscript 0 can thus indicate momentum u, temperature T or salinity S and the 
corresponding noise and variance will be indicated as o; odB; (x) and ag (x, t). The 
different decorrelation times are difficult to characterize precisely (as they depends 
in space), but their ratio can be justified by physical reasoning. The decorrelation 
times are chosen in such a way that ty = Af, tr = « At and ts = SAt, where 
At is the simulation time step and « the molecular diffusion coefficients. Each 
(eigen) frequency w comes in pairs and each pair of complex Brownian motion are 
conjugates. The real and imaginary parts of the Brownian motion are independent. 
As such, both the noise and Girsanov drift are real-valued fields. The variance tensor 
of such noise remains stationary: 


aj (X) =) DY) 9,009, 09). Q8) 
meM*" 


After construction with the offline data through Eqs.(26) and (27), the noise 
o;,90B; is constrained to live on the tangent space of the isopycnal surfaces. This 
procedure is operationally implemented as the application of an isopycnal projection 
operator P? 


Vp (Vy 


pe =] 
Vo]? 


(29) 


to the noise. Being the density function of temperature and salinity, o = p (T, S, z), 
the isopycnal projection operator carries information about the current state of the 
simulation. The projected noise o,dB? (x) = P^ c;dB, (x) is thus strongly tied to 
the evolution of the flow density. 
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5 Results 


In this work we analyse the result of a single realisation. Qualitatively speaking, 
a coarser resolution simulation is far from being as representative as the fine 
resolution simulation, as can be seen in both Figs. 2 and 3, where the leftmost panel 
contains results from the R27 deterministic simulation, the centre panel contains 
the R3 deterministic simulation and the rightmost panel contains the R3 stochastic 
simulation. The effect of the forcing is manifested in the R27 simulation in a jet 
current roughly aligned with latitude (tilted by a 45 degrees angle in the domain 
frame). The presence of this structure and of a secondary and smaller jet stream, is 


visible in the reference papers [16, 17]. This feature is absent in the R3 deterministic 
simulation. Figure 2 depicts the average relative vorticity ¢'° = (8 vU — Ayu) /f = 


and shows primarily the difference between a high resolution simulation and a 
coarse resolution simulation. In the centre panel, representing the deterministic low 
resolution simulation R3d, there is no sign of the characteristic jet that can be seen 
in the left panel as a strong contraposition of opposite sign vorticity. Viscosity on 
the boundary creates a sequence of alternating bands of opposite vorticity, related 
to jet separation problem in low resolution simulations [20]. The stochastic R3LU 
simulation conversely presents a much better representation of the dynamics, as a 
jet can now be identified clearly. The time averaged sea surface height, represented 
in Fig.3 shows for the high resolution simulation the characteristic geostrophic 
properties of the jet stream: the northern, cold sub-polar gyre is characterised by 
a smaller height than the southern, warm sub-tropical gyre. This characteristic is 
not visible in the deterministic low resolution simulation R3d, where one can only 
find the effects of the boundary, while the stochastic R3LU simulation presents 


R27d R3d R3LU 
T - 


0.05 0.1 


0.0 


-0.05 


-0.1 


Fig. 2 10-years averaged relative vorticity ¢ = (8xv — yu) /f at the surface layer of the model 
for deterministic high-resolution (1/27°, left), for deterministic low resolution (1/3°, middle) and 
for stochastic low resolution (1/3°, right) 
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Fig. 3 10-years averaged sea surface height for deterministic high-resolution (1/27°, left), for 
deterministic low resolution (1/3°, middle) and for stochastic low resolution (1/3°, right) 
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Fig. 4 10-years averaged kinetic energy for deterministic high-resolution (1/27°, left), for deter- 
ministic low resolution (1/3°, middle) and for stochastic low resolution (1/3, right) 
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a much more faded picture of this process, which is not as intense as in the 
R27d but definitely present. Viscosity on the left boundary provides a strong 
constraint to the dynamics, biasing the representation of the jet stream in both low- 


resolution cases. Figure 4 shows the ten years average of kinetic energy, KE" = 


(u? + v2) /2 From this picture is clear that, while the stochastic simulation 
includes much more features when compared to its deterministic counterpart, it is 
still suffering the influence of the boundary, affecting the position of the jet stream 
separation. Figure 5 shows the vertical ae Of Sonona y averaged temperature 
at different times for the three simulation, v e D) — 4 Lf AT (x,y,z, t) dxdy. 
The deterministic high resolution profile is D in green, the deterministic low 
resolution profile is plotted in orange and the stochastic low resolution simulation is 
plotted in dark red. From this plot it is difficult to assess the benefits of the stochastic 
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Fig. 5 Vertical profile of temperature at 5 equidistant points in time. In red, the stochastic 
simulation, in orange the deterministic simulation, in green the reference R27 simulation 


formalism to the temperature equation, as the stochastic and deterministic low 
resolutions simulation are close. However, no spurious vertical mixing is observed. 
This corresponds to an improvement with respect to [12], and is brought by the 
new methodology for noise generation detailed in the previous section. In particular, 
constraining the noise on the isopycnal surfaces tangent planes considerably reduces 
the spurious mixing in POD as well. Furthermore, the projection operator has the 
effect of localising the effects of the noise on the jet region. From a quantitative 
point of view the simulations are compared using the Root Mean Square 


RMS(f,) = y E [A]; (30) 


providing a measure of the energy content for the variable fy, and the Root Mean 
Square Error 


RMSE(f,) = / E [Mha] - ^? |. GI) 
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Table 2 Comparison of the t H (u? m v2) T 


values obtained with the =7 
diagnostic RMS(f ) R27d |3.25e—06 | 18.48e—02 | 6.72 
R3d 4.30e—07 | 1.69e—02 6.56 
R3LU | 1.62e—06 | 9.21e—02 6.56 
RMSE(f’) R3d 3.67e—06 | 17.17e—02 | 0.58 
R3LU | 4.18e—06 | 16.46e—02 | 0.54 
GRE R3d 547.78 2987.56 27.46 
R3LU | 2.92 1.46 7.49 


Boldface is used to highlight the best performance accord- 
ing to the metric. 


that in turns describes the energy content of the errors. In all previous equations the 
expected value is considered to be taken in the volume, i.e. E[f] = Y J, fav. 
Finally, defining F“ = I Jr f dt the time average and o; = I frf- fy. dt the 
time variance, the Relative Gaussian Entropy [21] (GRE) at a single point 


GRE = 


2 —t 2 2 
1| Fra- fM $ do (=) (32) 


2 ofm 9 iM ofM 
measures with a single criterion both the mean and variance reconstructions. The 
first term on the right-hand side of GRE represents the error in the mean weighted 
by the variance of the model. The remaining terms measure the error in model 
variability and is referred to as “dispersion”. The lower this criterion the better 
the reconstruction. It can be observed from (32) that this criterion is minimal 
if, for all points, the mean is perfectly reconstructed and if the variance of the 
reference equals the one of the coarse model tested. These quantitative measures 
have been evaluated for the mean component of three quantities, namely vorticity 
¢, the horizontal energy of the flow 1 (u? + v?) and temperature T'. The values of 
the proposed metrics for each simulation are given in Table 2. Figure 6 provides a 
visual representation of the behaviour of the fluctuations around the mean states of 
vorticity and energy, where the fluctuations are computed in time in a Reynolds 
splitting fashion, f/ = f — f. The first 2 years can be considered as the 
time required for some adjustment of the filtered and downsampled 1/9? initial 
condition. It can be outlined that, while the deterministic simulation cannot sustain 
the initial level of variability, its stochastic counterpart shows opposite behaviour, 
maintaining a higher variability. Concerning the benefits of the stochastic model to 
the dynamical quantities ¢ and 5 (u? + v’), all the presented results show that the 
stochastic model outperforms the deterministic simulation. The mean flow contains 
much more energy than the deterministic counterpart, as stated by higher values 
of RMS’), the RMSE(f') of the average fields seems to be reduced , with the 
exception of vorticity, for which a systematic bias in the positioning of the jet stream 
jeopardise the computation of the RMSE. In other words the L? norm is lower for 
a null vorticity field than for a vorticity field exhibiting clearly a meaningful jet 
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Fig. 6 Comparison between the fluctuations Root Mean Square (in space) along 10 years. In blue, 
the fluctuation RMS for the deterministic high resolution simulation (R27d). In green, the low 
resolution stochastic simulation (R3LU), in orange the low resolution deterministic simulation 
(R3d). The solid line shows for the three cases the values of vorticity, with corresponding scale on 
the left; the dotted line shows for the three cases the values of Energy, with corresponding scale on 
the right 


but with bias. The corresponding fluctuations show a higher RMS in time during 
the whole simulation, thus the stochastic simulation is energetically closer to the 
high resolution simulation. The distance between the stochastic simulation and the 
high resolution simulation, as measured by the GRE, is lower for both vorticity and 
energy when compared to a deterministic simulation. The effect of the stochastic 
parametrisation to the distribution of temperature T is of more difficult assessment, 
as the metrics show very similar behaviours. The distance measured by the GRE is 
lower than for the stochastic case, as much as the corresponding RMSE. However, 
the RMS of the mean temperature are very similar between deterministic and 
stochastic simulation. A comparison of the temperature fluctuations RMS in time 
(similar to that of Fig. 6 and not presented in this paper) confirms this similarity by 
showing no sensible difference between the simulations. 


6 Conclusions 


The considered stochastic model has been implemented in the ocean model NEMO. 
A dynamical mode decomposition based noise was considered and has been shown 
to be beneficial to improve the variability of the coarse resolution models and to 
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represent temporal statistics in a more accurate fashion. The intrinsic variability of 
the model has been greatly enhanced for dynamical variables as vorticity and energy, 
as much as the qualitative behaviour of both long time average and time-snapshots 
appearance. The same benefits do not seem to apply to thermodynamic quantities 
like temperature. 
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